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The invention relates to DNA which codes for fi-tubulin from nematodes of the 
family of the Strongylidae, the polypeptide encoded by this DNA, the use of the 
DNA for the diagnosis of anthelmintic resistance of these nematodes and for the 
identification of the species of these nematodes, the use of the fj-tubulin as a 
constituent of a vaccine, and a process for the identification of new anthelmintic or 
antibiotic compounds. 



Parasitic helminths (worms) are a health problem for humans and animals and cause 
significant economic damage. In 1996, the expenditure for anthelmintics worldwide 
was more than 2 billion US dollars. The most important anthelmintics which are 
presently used can be divided into three groups according to their mechanism of 
action: 



1. The cyclic amidines pyrantel and morantel act, together with the 
imidazothiazoles tetramisole and levamisole, as cholinergic compounds for 
the parasitic nervous system. 

2. The benzimidazoles are inhibitors of the polymerization of microtubules and 
lead to the degradation of tubulin, followed by the loss of a number of cell 
functions such as transport within cells and cell division. 



3. The macrocyclic lactones bind and open glutaminergic chloride channels and 
thus act as inhibitors of the nervous system of nematodes and arthropods. 

Microtubules are built up from tubulin subunits. Tubulin is a dimeric protein which 
consists of a- and B-tubulin and is in a dynamic equilibrium between tubulin and 
microtubules. This equilibrium can be influenced by exogenous substances, which are 
designated as microtubule inhibitors. Some of these inhibitors, such as, for example, 
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the benzimidazoles, act by their binding to tubulin, whereby they prevent the self- 
association of these subunits to growing microtubules, while at the opposite end the 
dissociation of the microtubules is continued. On account of this, dysfunctions occur 
in processes within the cell which are important to life and finally the death of the 
5 cell and of the entire organism occurs (Lacey, E. (1990) Mode of action of benz- 
imidazoles. Parasitology Today 6, 112-115). Such microtubule inhibitors include 
various classes of compound which are synthetically prepared or produced by various 
organisms. 

10 The binding of microtubule inhibitors to tubulin from various organisms shows great 
differences with respect to the affinity of the binding. Thus the anthelmintics 
oxfendazole and thiabendazole show a high affinity for tubulin from Ascaridia galli 
and an only slight affinity for tubulin from mammals such as the sheep (Dawson et 
al. (1983) Purification and characterisation of tubulin from the parasitic nematode, 

1 5 Ascaridia galli, Molecular and Biochemical Parasitology 7, 267-277). The selective 
toxicity of benzimidazoles can be explained by means of this selective affinity 
(Lacey, E. (1988) The role of the cytoskeletal protein, tubulin, in the mode of action 
and mechanism of drug resistance to benzimidazoles, International Journal for 
Parasitology 18, 885-936). 

20 

The widespread use of these anthelmintics has led to considerable resistance 
problems against all three classes, especially in livestock (Bauer et al. (1994) 
Anthelmintic resistance in nematodes of farm animals. A seminar organised for the 
European Commission, Brussels, Belgium, 8 to 9 November 1993, pp. 17-24). The 

25 most widespread class of anthelmintics are the benzimidazoles. Resistance to 
benzimidazoles has been described worldwide in the case of parasites of sheep, 
cattle, pigs and horses. Benzimidazoles are broad-spectrum anthelmintics with action 
against nematodes, cestodes and trematodes. Investigations in German stud farms 
showed a resistance of small strongylids to benzimidazoles in more than 80% of the 

30 cases (Ullrich et al. (1988) Benzimidazole resistance in small strongylids 
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(Cyathostominae): distribution in horse stock in Northrhine-Westphalia, Berliner 
Miinchner Tierarztliche Wochenschrift 101. 406-408). 

For an effective treatment with anthelmintics, it is therefore of great importance to 
5 obtain information about possible resistance of worms in a horse or within a herd of 
horses. In order to check the possible resistance of a worm population of small 
stronglids, diagnostic procedures have already been developed which are based on 
the activity of anthelmintics in the stages of development of the parasites (eggs, 
larvae) (Coles et al. (1992) World Association for the Advancement of Veterinary 

10 Parasitology (W.A.A.V.P.) Methods for the detection of anthelmintic resistance in 
nematodes of veterinary importance, Veterinary Parasitology, 44, 35-44). The "larval 
development assay" (LDA) describes the inhibitory effect of anthelmintics in the 
nonparasitic first larval stage as a function of the anthelmintic concentration 
employed. In a similar manner, the second approach, the "egg hatch assay" (EHA) 

15 uses the inhibitory action of the anthelmintics on the hatching of the larvae as a 
function of the anthelmintic concentration employed. A disadvantage of both 
approaches lies in the low reproducibility of the results. Moreover, both approaches 
are time-consuming and labor-intensive. 

20 The sensitivity of both approaches is moreover low. An existing resistance is only 
detected if more than 25% of the population are resistant (Roos et al. (1995) New 
genetic and practical implications of selection for anthelmintic resistance in parasitic 
nematodes, Parasitology Today 1 1, 148-150). 

25 There is therefore an urgent need for diagnostic procedures which are sensitive, rapid 
and reproducible. For the sheep parasites, Haemonchus contortus and Teladorsagia 
circumcincta, a procedure based on the PCR technique has been described. This is 
based on at least one point mutation in codon 200 of the B-tubulin isotype 1 gene 
(Elard et al. (1999) PCR diagnosis of benzimidazole - susceptibility or - resistance in 

30 natural populations of the small ruminant parasite, Teladorsagia circumcincta, 
Veterinary Parasitology 80, 231-237). The point mutations in codon 200 result in an 
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amino acid replacement of phenylalanine by tyrosine and correlates with a 
benzimidazole resistance of the mutated protein (Kwa et al. (1995) B-tubulin genes 
from parasitic nematode Haemonchus contortus modulate drug resistance in 
Caenorhabditis elegans, Journal of Molecular Biology 246, 500-510). 

5 

From studies on Haemonchus contortus, it is known that the sequences of nematodes 
coding for tubulin are species-specific (WO 92/03549). 

The various species of the small strongylids of the horse differ in their 
10 epidemiological frequency. The species which are most important and found most 
frequently worldwide include Cylicocyclus nassatus, Cyathostomum coronatum and 
Cyathostomum catinatum. Resistances of these species to benzimidazoles have been 
described in various countries, inter alia also in Germany (Burger, H.-J. and Bauer, 
C. (1987) Efficacy of four anthelmintics against benzimidazole - resistant 
1 5 cyathostomes of horses, Veterinary Record 120, 293-296). 

The nucleic acid sequencing of B-tubulin cDNAs from the species of small 
strongylids according to the invention had an identity of over 95%. The identity with 
the known, abovementioned 6-tubulin sequences of sheep parasites, however, is only 
20 75.4-82.6%. 

The derived amino acid sequences are very similar within the sequences according to 
the invention. This is also true of the derived B-tubulin amino acid sequences of the 
sheep parasites. The identity here is between 95 and 99.8%. Only very few positions 
25 result in which amino acid exchange occurs. To be emphasized here is codon 200, in 
which a change from phenylalanine to tyrosine results. 

However, the noncoding B-tubulin sequences from various helminth species 
published hitherto show no significant identity. It is accordingly surprising that not 
30 only the coding sequences, but also parts of the noncoding sequences, of the various 
species of small strongylids of the application present here have a high identity. 
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These regions are therefore also suitable to be able to differentiate various species of 
small strongylids and other nematode species from one another. PCR primers which 
are derived from these intron regions can be used for the specific detection of small 
strongylids within a sample which also contains genetic material from other helminth 
5 organisms. 

B-Tubulin from nematodes or parts thereof are known for having a protective, 
immunological potential (Bughio et al. (1993) Characterisation and biological 
activities of anti-Brugia pahangi tubulin monoclonal antibodies, International 
10 Journal for Parasitology, 7, 913-924). The B-tubulin of the small strongylids encoded 
by the abovementioned DNA can be used as a vaccine just as monoclonal antibodies 
can be used against the B-tubulin. 

Inhibitors of the interaction of the tubulin or of its subunits, such as benzimidazoles, 
1 5 colchizine and taxol, are important lead structures of a series of therapeutics which 
are directed against human, animal or plant diseases. The importance of tubulin as 
the target of these compounds is far-reaching and underlines its potential for the 
search for new active compounds for the control of these diseases. 

20 The invention relates to the DNA coding for the B-tubulin from nematodes of the 
family of the Strongylidae, particularly of the subfamily of the Cyathostominae, or 
fragments of this DNA. The B-tubulin DNA can in this case be genomic DNA or 
cDNA. The DNA sequences which this invention relates to can be considered as new 
members of the tubulin gene family of parasitic nematodes of the order Strongylida, 

25 particularly of the subfamily of the Cyathostominae. The invention relates very 
particularly to the DNA sequences which code for B-tubulin from parasitic 
nematodes of the genera Cyathostomum and Cylicocyclus. 

The invention likewise relates to DNA sequences which have an identity of more 
30 than 85% to a polynucleotide coding for one of the amino acid sequences as set forth 
in SEQ ID NO. 2, 4, 6, 8 or 10. 
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The invention likewise relates to preferred DNA sequences which have an identity of 
more than 95% to a polynucleotide coding for one of the amino acid sequences as set 
forth in SEQ ID NO. 2, 4, 6, 8 or 10. 

5 

The invention relates in particular to DNA sequences coding for 6-tubulin, which 
originate from parasitic nematodes of the genera Cylicocyclus and Cyathostomum, 
very particularly those sequences which originate from parasitic nematodes of the 
species Cylicocyclus nassatus, preferably DNA as set forth in SEQ ID NO. 3, 5, 7, 9 
10 or 1 1 or from Cyathostonum coronatum, preferably DNA according to SEQ ID NO. 

1. 

The invention likewise relates to DNA sequences as described above, which in 
contrast to these sequences have at least one point mutation or one nucleotide 
1 5 replacement in codon 200. These point mutations result in a change in the amino acid 
sequence encoded by this DNA, e.g. a replacement of the amino acid phenylalanine 
by tyrosine, and correlate with the resistance of tubulin with appropriate mutations to 
benzimidazoles. 

20 The invention likewise relates to DNA sequences which are complementary to the 
DNA described above or fragments of this DNA, and to fragments of these DNA 
sequences. These DNA sequences or these fragments comprise oligonucleotides 
which are derived from one of the DNA sequences mentioned above or described 
under SEQ ID NO. 1, 3, 5, 7, 9 or 11 or sequences identical to 85% thereto, 

25 preferably sequences identical to 95%, and are derived from strands complementary 
thereto and can hybridize to these. 

The invention in this case relates preferably to oligonucleotides consisting of or 
comprising one of the sequences as set forth in SEQ ID NO. 12 to SEQ ID NO. 51, 
30 which hybridize to abovementioned DNA sequences, preferably in the region of 
noncoding sequence sections of the B-tubulin genes. 
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The invention likewise preferably relates to oligonucleotides consisting of or 
comprising one of the sequences as set forth in SEQ ID NO. 12 to SEQ ID NO. 51, 
which hybridize to coding regions of the abovementioned sequences. 

5 

The invention likewise relates to RNA sequences which are complementary to the 
DNA described above or fragments of this DNA, and fragments of these RNA 
sequences. These RNA sequences or these fragments comprise ribooligonucleotides 
which correspond to a region of one of the DNA sequences mentioned above or 
10 described under SEQ ID NO. 1, 3, 5, 7, 9 or 1 1, sequences complementary thereto or 
DNA sequences 85% identical, preferably 95% identical, thereto, and can hybridize 
to these. 

The invention likewise relates to an expression construct which comprises one of the 
15 DNA sequences described above, and to a DNA sequence linked thereto which 
makes possible the expression of the DNA. These include, for example, at least one 
promoter for constitutive or inducible expression or alternatively enhancers. Suitable 
promoters for expression in E. coli are natural hybrid or bacteriophage promoters, 
preferably promoters of the group of X-phages, hsp, omp or synthetic promoters such 
20 as mentioned, for example, in WO 98/5625, DE 3 430 683 or EP 0 1 73 149. 

The invention likewise relates to vectors which comprise one of the DNA sequences 
described above and make possible the expression of the J3-tubulin according to the 
invention or fragments thereof in a host cell. 

25 

The invention likewise relates to host cells which contain the abovementioned DNA, 
an expression construct as mentioned above, or a vector and allow the expression of 
the B-tubulin or fragments thereof. 
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The invention likewise relates to polypeptides which are encoded by one of the 
abovementioned DNA sequences or fragments of these DNA sequences, and to 
fragments of these polypeptides. 

5 The invention in this case relates preferably to polypeptides which are encoded by a 
DNA sequence comprising SEQ ID NO. 1, 3, 5, 7, 9 or 1 1, by DNA sequences which 
have an identity of 85% to these sequences, preferably of 95%, or of fragments of 
this DNA. 

1 0 The invention also relates to polypeptides which are encoded by the DNA sequence 
described above, which contain at least one point mutation in codon 200 as described 
above and show resistance to benzimidazoles, and to fragments of these 
polypeptides. 

15 The invention in this case relates very particularly preferably to polypeptides 
comprising one of the amino acid sequences described in SEQ ID NO. 2, 4, 6, 8 or 1 0 
or fragments thereof. 

The invention in this case relates to polypeptides, particularly to purified 
20 polypeptides or polypeptides prepared recombinantly. 

The invention relates to polypeptides of full length and also to corresponding 
fragments of these polypeptides, e.g. certain motifs or domains. These fragments can 
be of different length and comprise, for example, 5, 10, 25, 50, 100, 150, 200, 250 or 
25 300 amino acids. 

This invention likewise relates to fusion proteins which comprise a polypeptide as 
described above. The fusion protein can in this case contain a further polypeptide 
component which is not connected to the B-tubulin (e.g. LexA, B42, glutathione S- 
30 transferase, a His tag, a polypeptide having enzymatic activity such as alkaline 
phosphatase or an epitope tag). 
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The invention also relates to a process for the preparation of a polypeptide as 
described above in suitable prokaryotic or eukaryotic expression systems. The 
expression can in this case be carried out permanently or transiently as described 
5 above in a corresponding cell line in each case or corresponding host cells. Suitable 
prokaryotic expression systems are known host-vector systems such as bacteria (e.g. 
Streptomyces spp., Bacillus subtilis, Salmonella typhimurium, Serratia marcescens 
and particularly Escherichia coli). 

1 0 Expression in a eukaryotic system is preferably carried out in the baculovirus system, 
particularly in a system which allows the introduction of post-translational 
modifications. 

This invention likewise relates to the use of DNA as mentioned above for the 
15 detection of DNA from nematodes of the family Strongylidae, preferably of the 
subfamily Cyathostominae, particularly preferably of the genera Cyathostomum and 
Cylicocyclus, very particularly preferably of the species Cyathostomum coronatum 
and Cylicocyclus nassatus. The invention in this case relates to oligonucleotides as 
mentioned above which are complementary to DNA coding for 6-tubulin or strands 
20 complementary thereto and can hybridize to this DNA. Preferably, these 
oligonucleotides hybridize to the intron regions, i.e. the noncoding DNA sequences. 
The invention relates to the use of these oligonucleotides or parts thereof as 

a) samples in Northern or Southern blot assays, 

25 

b) PCR primers in a diagnostic procedure for the detection of the 
abovementioned nematodes, the DNA of the nematodes concerned being 
specifically identified and amplified with the aid of the primer and the PCR 
technique. 



30 
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The invention in this case relates preferably to oligonucleotides consisting of or 
comprising one of the sequences as set forth in SEQ ID NO. 12 to SEQ ID NO. 51. 

The invention likewise relates to the use of DNA as mentioned above for the 
detection of DNA from nematodes of the family of the Strongylidae, preferably of the 
subfamily of the Cyathostominae, particularly preferably of the genera Cylicocyclus 
and Cyathostomum, very particularly preferably of the species Cylicocyclus nassatus 
and Cyathostomum coronatum, which codes for B-tubulin or fragments thereof, 
which are resistant to benzimidazoles. The invention in this case relates to 
oligonucleotides as mentioned above which are complementary to DNAs which code 
for B-tubulin with a resistance to benzimidazoles or to the complementary strands of 
this DNA and which can specifically hybridize to this DNA. 

The invention also relates to the use of these oligonucleotides or parts thereof as 

a) samples in Northern or Southern blot assays, 

b) PCR primers in a diagnostic procedure for the detection of the 
abovementioned nematodes with a resistance to benzimidazoles, the DNA of 
the nematodes concerned being specifically identified and amplified with the 
aid of the primer and the PCR technique. 

The invention in this case preferably relates to oligonucleotides consisting of or 
comprising one of the sequences as set forth in SEQ ID NO. 12 to SEQ ID NO. 51. 

The invention also relates to a procedure for the detection of nematodes of the family 
of the Strongylidae, preferably of the subfamily of the Cyathostominae, particularly 
preferably of the genera Cylicocyclus and Cyathostomum, very particularly 
preferably of the species Cylicocyclus nassatus and Cyathostomum coronatum, where 
oligonucleotides as described above specifically hybridize to DNA sequences which 
originate from the organisms mentioned, and which can be amplified with the aid of 
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the PCR technique. The hybridization is preferably carried out in the noncoding 
regions of the p-tubulin gene (introns). 

The detection of organisms as mentioned above can be carried out, for example, by 

5 

a) making available an oligonucleotide probe or primer which can hybridize to 
the abovementioned DNA coding for B-tubulin or strands complementary 
thereto or to the 5'- or 3'-flanking regions thereof, 



b) bringing the oligonucleotide probe or the primers into contact with an 
appropriately prepared probe containing DNA, 

c) detecting the hybridization of the oligonucleotide or primer (e.g. with the aid 
of the polymerase chain reaction), 

d) sequencing the detected sequence of the B-tubulin gene, and 

e) comparing this sequence with the DNA sequences according to the invention 
which were described above, preferably with DNA sequences as set forth in 
SEQ ID NO. 1, 3, 5, 7, 9 or 1 1. 



The invention also relates to a process for the detection of nematodes of the family of 
the Strongylidae, preferably of the subfamily of the Cyathostominae 4 particularly 
preferably of the genera Cylicocyclus and Cyathostomum, very particularly 

25 preferably of the species Cylicocyclus nassatus and Cyathostomum coronatum, which 
are resistant to benzimidazoles, where oligonucleotides as described above 
specifically hybridize to DNA sequences which originate from the organisms 
mentioned and which can be amplified with the aid of the PCR technique. The 
hybridization is preferably carried out in the noncoding regions of the P-tubulin gene 

30 (introns). 
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The detection of organisms as mentioned above can be carried out, for example, by 

a) making available an oligonucleotide probe or primer which can hybridize to 
the abovementioned DNA coding for I3-tubulin or strands complementary 
thereto or to the 5'- or 3 '-flanking regions thereof, 

b) bringing the oligonucleotide probe or the primers into contact with an 
appropriately prepared probe containing DNA, 

c) detecting the hybridization of the oligonucleotide or primer (e.g. with the aid 
of the polymerase chain reaction), 

d) sequencing the detected sequence of the B-tubulin gene, and 

e) comparing this sequence with the DNA sequences according to the invention 
which were described above, preferably with DNA sequences as set forth in 
SEQ ID NO. 1, 3, 5, 7, 9 or 1 1, having at least one point mutation in codon 
200, which leads to a resistance of the p-tubulin encoded by these sequences 
to benzimidazoles. 

The invention likewise relates to a diagnostic test kit for the detection and 
identification of nematodes of the family of the Strongylidae, preferably of the 
subfamily of the Cyathostominae i particularly preferably of the genera Cylicocyclus 
and Cyathostomum, very particularly preferably of the species Cylicocyclus nassatus 
and Cyathostomum coronatum, which, inter alia, makes available oligonucleotides as 
described above, which can be used in procedures for the detection of the species 
mentioned. The present invention likewise makes available oligonucleotides which 
specifically hybridize to sequences as set forth in SEQ ID NO. 1, 3, 5, 7, 9 or 11, 
sequences complementary thereto, sequences having at least one point mutation in 
codon 200, or fragments of these sequences. In this case, oligonucleotides consisting 
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of or including sequences as set forth in SEQ ID NO. 12 to SEQ ID NO. 51 are 
particularly preferred. 

The invention likewise relates to a diagnostic test kit for the detection of nematodes 
5 of the family of the Strongylidae, preferably of the subfamily of the Cyathostominae^ 
particularly preferably of the genera Cylicocyclus and Cyathostomum, very 
particularly preferably of the species Cylicocyclus nassatus and Cyathostomum 
coronatum, having a resistance to benzimidazoles which, inter alia, makes available 
oligonucleotides as described above, which can be used in procedures for the 

10 detection of the species mentioned. The present invention likewise makes available 
oligonucleotides which hybridize specifically to sequences as set forth in SEQ ID 
NO. 1, 3, 5, 7, 9 or 1 1, sequences complementary thereto, sequences having at least 
one point mutation in codon 200, or fragments of these sequences. In this case, 
oligonucleotides consisting of or comprising sequences as set forth in SEQ ID NO. 

15 1 2 to SEQ ID NO. 5 1 are particularly preferred. 

The invention likewise relates to a diagnostic test kit as described above, where the 
oligonucleotides made available in this kit are provided with a detectable marker. 
Such detectable markers can include, inter alia, enzymes, enzyme substrates, 
20 coenzymes, enzyme inhibitors, fluorescence markers, chromophores, luminescent 

markers and radioisotopes. 

This invention likewise relates to antibodies which react specifically with an epitope 
of a 6-tubulin from nematodes of the family of the Strongylidae, preferably of the 
25 subfamily of the Cyathostominae, particularly preferably of the genera Cylicocyclus 
and Cyathostomum, very particularly preferably of the species Cylicocyclus nassatus 
and Cyathostomum coronatum. 

This invention likewise particularly relates to monoclonal antibodies which react 
30 specifically with an epitope of a G-tubulin from nematodes of the family of the 
Strongylidae, preferably of the subfamily of the Cyathostominaei particularly 
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preferably of the genera Cylicocyclus and Cyathostomum, very particularly 
preferably of the species Cylicocyclus nassatus and Cyathostomum coronatum. 

This invention likewise relates to the use of the abovementioned antibodies as 
5 nematicides. 

This invention also relates to the use of the abovementioned B-tubulin polypeptides 
or fragments thereof from nematodes of the family of the Strongylidae, preferably of 
the subfamily of the Cyathostominae, particularly preferably of the genera 
10 Cylicocyclus and Cyathostomum, very particularly preferably of the species 
Cylicocyclus nassatus and Cyathostomum coronatum, for the preparation of vaccines 
which contain at least one of the B-tubulin polypeptides mentioned or fragments 
thereof. The vaccine is in this case able to produce an immune response which is 
specific for a B-tubulin described above. 

15 

In a preferred embodiment, the vaccine contains an antigenic determinant, e.g. an 
individual determinant of a polypeptide having an amino acid sequence as set forth in 
SEQ ID NO. 2, 4, 6, 8 or 10 or of a polypeptide which is encoded by one of the 
abovementioned DNAs or fragments thereof. 

20 

The invention likewise relates to a procedure for the preparation of an immunogenic 
composition for the immunization of mammals, consisting of at least one of the 
abovementioned B-tubulin polypeptides according to the invention or fragments 
thereof or of at least one of the abovementioned antibodies. 

25 

The invention likewise relates to the use of the expression vectors described above 
containing a nucleic acid coding for a P-tubulin according to the invention, preferably 
a sequence as set forth in SEQ ID NO. 1, 3, 5, 7, 9 or 1 1, fragments thereof or 
sequences homologous thereto for the preparation of an immunogenic composition 
30 for administration to a host for the activation of a protective immune response in this 
host, which is directed at p-tubulin from parasitic nematodes. 
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The invention likewise relates to an immunogenic composition comprising a vector 
(comprising a nucleic acid coding for the p-tubulin according to the invention, 
preferably a sequence as set forth in SEQ ID NO. 1, 3, 5, 7, 9 or 11, fragments 
5 thereof or sequences homologous thereto and to a promoter sequence which is linked 
functionally to said nucleotide sequence and which controls the expression of the 0- 
tubulin according to the invention producing an immune response) and a vehicle 
suitable for pharmaceutical purposes. 



1 0 This invention likewise relates to a process for the identification of substances which 
modulate the interaction of tubulin or the interaction of subunits of tubulin. The 
process is based on the use of tubulin, preferably on tubulin from parasitic 
nematodes, particularly preferably on B-tubulin from parasitic nematodes of the order 
strongylida, very particularly preferably on B-tubulin from parasitic nematodes of the 

15 family of the Strongylidae, most preferably on B-tubulin from parasitic nematodes of 
the subfamily of the Cyathostominae. A particularly preferred group of the B-tubulin 
used in this procedure is 6-tubulin from parasitic nematodes of the genera 
Cylicocyclus and Cyathostomum. 

20 The invention relates to the identification of substances, e.g. small organic molecules, 
which are able to modulate the interaction of tubulin protein molecules or its subunits 
with one another. Preferably, the invention relates to the identification of compounds 
which inhibit the interaction. 



25 The invention likewise relates to a procedure as described above, which is based on 



a) bringing the substance to be tested into contact with the tubulin, the 
conditions selected allowing the interaction of the tubulin molecules with one 
another and the binding of the test substance to tubulin, 



30 
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b) detecting the binding which has taken place by determining the ability of the 
tubulin protein molecules to interact with one another and 

c) comparing the ability of the tubulin protein molecules to interact with one 
5 another in the presence of a test substance to the ability to interact with one 

another in the absence of a test substance. 

The invention likewise relates to a process for the identification of substances which 
modulate the ability of tubulin molecules to interact with one another. Particularly 
preferably, the invention in this case relates to a procedure which uses one of the 
polypeptides described above which are coded by the DNAs described above or 
fragments thereof, particularly by DNAs consisting of or comprising sequences as set 
forth in SEQ ID NO. 1, 3, 5, 7, 9 or 11, and by sequences which have an identity of 
85% thereto, preferably of 95% and code for B-tubulin which has an amino acid 
sequence as set forth in SEQ 2, 4, 6, 8 or 10. 

The invention likewise relates to a procedure for the identification of substances 
which modulate the ability of tubulin to interact with one another as described above, 
the procedure used being based on detecting a modulation of the tubulin interaction 
in the presence of a test substance with the aid of a test system based on cells. A 
preferred embodiment of such a test system is the "two hybrid system" 
(US 5 283 317, Zervos et al. (1993) Cell 72, 223-232; WO 94/10300). This system is 
suitable for documenting or describing the interaction of two proteins by the 
interaction leading to a detectable signal. Such a system can also be adapted to test 
systems having high throughput numbers. 

The invention likewise relates to a procedure for the identification of substances 
which modulate the ability of tubulin to interact with one another, the procedure used 
being based on detecting a modulation of the tubulin interaction in the presence of a 
30 test substance with the aid of a cell-free test system. A particularly preferred 
embodiment of such a test system is the "scintillation proximity assay" (SPA) 



10 
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(EP 015 473). This test system is based on the detection of an interaction of a 
receptor bound to microspheres or beads, e.g. of a tubulin molecule with a ligand, the 
microspheres or beads being provided with a scintillating molecule. A signal is 
detected if the receptor-ligand complex decomposes. 

5 

The invention likewise relates to substances which have hitherto still not been 
described, which are identified with the aid of the procedure described above and are 
suitable to modulate, preferably to inhibit, the interaction of tubulin molecules. 

10 The invention likewise relates to the use of substances which have hitherto still not 
been described, which were identified using one of the procedures described above, 
for the preparation of an agent which is used for the prophylactic or therapeutic 
treatment of animals or humans which can be attacked or have been attacked by 
nematodes. The agents according to the invention contain at least one of the 

15 substances identified by one of the procedures described above and can be 
administered nasally, dermally, parenterally or enterally. 

For better understanding, the meaning of certain words and terms which are used in 
the description, the examples and attached claims are explained in greater detail 
20 below. 

The term "fragments" with respect to proteins and DNA describes parts of the 
nucleic acids or amino acid sequences described under SEQ ID NO. 1 to 11, 
sequences complementary thereto or sequences identical thereto to 85%, preferably 
25 to 95%. The fragments of the DNA and polypeptide sequences comprise at least five 
nucleotides or amino acids, but can likewise comprise up to 447 amino acids or up to 
1 343 nucleotides, or up to 2565 nucleotides in the case of the sequence as set forth in 
SEQ ID NO. 11. 

30 The terms "homology", "identity" or "similarity" relate to sequence similarities 
between two peptides or between two nucleic acid molecules. Homology can be 
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determined by in each case comparing a position in each sequence with one another. 
If a position in the compared sequence is occupied by the same base or amino acid, 
the two molecules in this position are homologous. The measure of homology 
between sequences is a function of the number of the corresponding or homologous 
5 positions which the sequences share with one another. A "nonhomologous" sequence 
has an identity of less than 40%, but preferably of less than 25% identity. 



The term "homology" in particular means that DNA segments of a length of at least 
15 base pairs or strands complementary to the DNA agree to at least 85%, preferably 
10 95%, with the nucleotides with the corresponding DNA. A homology can be 
determined, inter alia, with the aid of computer programs such as the GCG program 
(Devereux et al. (1983), Nucleic Acids Res. 12, 387-395). 

A "homology" also exists if a DNA segment can hybridize to the DNA strand 
15 concerned or its complementary strand. 



The term "hybridize" or "hybridization" describes the process in which a single- 
stranded nucleic acid molecule undergoes base pairing with a complementary DNA 
strand, the ability of a single-stranded nucleic acid molecule depending on the 
20 stringency of the hybridization conditions. 



The term "stringency" relates to the hybridization conditions. "High stringency" is 
imparted if base pairing is rendered difficult. "Low stringency" is imparted if base 
pairing is facilitated. 

25 

The term "complementary" relates to the ability of purine and pyrimidine nucleotides 
to form base pairs with one another via hydrogen bridges. Complementary base pairs 
are, inter alia, guanine and cytosine, adenine and thymine and adenine and uracil. 



30 The person skilled in the art realizes that, on account of the degenerate genetic code 
(i.e. 64 codons code for 20 amino acids), numerous "silent" substitutions of 
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nucleotide base pairs can be introduced into the sequence coded therefor without 
changing the identity of the protein products encoded thereby. All such substitutions 
are to be contained in the scope of the invention. 

5 The term "specifically hybridize" relates to the ability of a nucleic acid molecule of 
the present invention to hybridize to at least approximately 6, 12, 20, 30, 50, 100, 
150, 200, 300, 350, 400 or 440 successive nucleotides of one of the B-tubulin genes 
described above, preferably to one of the sequences as set forth in SEQ ID NO. 1,3, 
5, 7, 9 or 1 1 or sequences homologous or complementary thereto, namely in such a 
10 way that ten times more molecules hybridize, preferably 100 times more molecules 
hybridize, and particularly preferably more than 100 times more molecules hybridize 
than to a cellular nucleic acid (e.g. mRNA or genomic DNA), which codes for a 
protein other than the 13-tubulin described above. 

15 The term "plasmid" relates to an extrachromosomal genetic element. The original 
plasmids used for the present invention are either commercially obtainable, freely 
accessible or can be derived from such plasmids according to known processes. 

The term "vector" describes a DNA element which is used for the introduction of 
20 exogenous DNA into host cells. A vector contains a nucleotide sequence which codes 
for one or more polypeptides. Vectors which are able to control the expression of the 
genes which they contain are designated as "expression vectors". 

The term "gene" relates in the scope of the present invention to a nucleic acid 
25 comprising an open reading frame which codes for one of the B-tubulin polypeptides 
described above. Exon and possible intron sequences are additionally included here. 

The term "interact" or "interaction" describes detectable interactions between 
molecules. The term "binding" is additionally included here. 

30 
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The term "modulate" relates both to a stimulation and to a suppression or inhibition 
of a biochemical process. In the context of the present invention, "modulation" 
means an inhibition or suppression of the interaction between tubulin polypeptides or 
fragments or subunits thereof, or a stimulation of this interaction which can be 
5 shown, for example, in an irreversible binding of tubulin polypeptides to one another. 

The term "nucleic acid" relates to polynucleotides such as deoxyribonucleic acids 
(DNA) or, if appropriate, to ribonucleic acids (RNA). In an equivalent manner, the 
term also includes analogs of RNA or DNA which are prepared from nucleotide 
10 analogs, and, in the case concerned, single-stranded ("sense" or "antisense") and 
double-stranded polynucleotides. 

The term "promoter" relates to DNA sequences which regulate the expression of a 
specific DNA, which are functionally linked to the promoter. The term also includes 
15 "tissue-specific" promoters, i.e. promoters which control the expression of the 
specific DNA only in certain cells (e.g. cells of a certain tissue). Likewise included 
are "tissue-nonspecific" promoters and promoters which lead to a constitutive 
expression or are inducible. 

20 The terms "protein", "polypeptide" and "peptide" are exchangeable in their use in the 
context of the present application if they relate to a gene product. 

A "fusion protein" is a fusion of a first amino acid sequence coding for one of the f3- 
tubulin polypeptides described above having a second amino acid sequence which 

25 has no commonality or fundamental homology to the tubulin sequence. The second 
amino acid sequence can in this case originate from the same organism as the first, or 
alternatively can originate from another organism (intergenic). In general, a fusion 
protein can be represented by means of the formula X-tubuIin-Y, where "tubulin" 
represents one of the polypeptides described above, and X and Y represent a 

30 polypeptide which is not connected to a tubulin amino acid sequence. X or Y can in 
each case independently of one another be absent. 
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The terms "cell" or "host cell" can be used in the same sense in the context of the 
application present here. It is understood that these terms relate not only to an 
individual cell, but also to the descendants of such a cell. On account of certain 
5 modifications in the course of following generations (e.g. mutations), such 
descendants are possibly not identical to the stem cell, but are additionally included 
by the present invention. 

The term "intron" describes those sequences of the described, preferably genomic, 
10 DNA which are transcribed, but are then removed from the transcript by "splicing", 
the adjacent sequences (exons) being linked. 

Nucleic acids 

15 As already described, one aspect of the invention relates to nucleic acids from 
nematodes of the family of the Strongylidae, particularly of the subfamily of the 
Cyathostominae, very particularly to the genera Cyathostomum and Cylicocyclus, 
especially of the species Cylicocyclus nassatus and Cyathostomum coronatum, which 
code for B-tubulin polypeptides or fragments thereof or nucleic acids homologous 

20 thereto which are homologous to 85% to the [lacuna] in SEQ ID NO. 1, 3, 5, 7, 9 and 
11, preferably to 95%, and code for a B-tubulin according to one of the sequences as 
set forth in SEQ ID NO. 2, 4, 6, 8 or 10 or fragments thereof. SEQ ID NO. 3 
represents the degenerated sequence of the nucleic acid from Cylicocyclus nassatus 
coding for B-tubulin, where "r" represents a purine (guanine or adenine), "y" 

25 represents a pyrimidine (thymine, or uracil or cytosine) and "w" represents an 

adenine or a thymine, or a uracil. SEQ ID NO. 3 thus includes a number of sequences 
which can be present in the organisms of the species Cylicocyclus nassatus. The 
sequences as set forth in SEQ ID NO. 5, 7 and 1 1 show three defined sequences 
coding for B-tubulin, which are exemplary and preferred embodiments of DNA as set 

30 forth in SEQ ID NO. 3. 
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Likewise part of the invention are oligonucleotides which optionally code for B- 
tubulin polypentides which comprise a length of at least 2, 5, 10, 25, 50, 100, 150, 
200, 250, 300, 350 or 400 amino acids. Such oligonucleotides can be used as primers 
or antisense molecules (i.e. as noncoding nucleic acids) and comprise at least 
5 approximately 6, 12, 24, 30, 60, 100, 120, 150 or 210 base pairs, while coding 
nucleic acids comprise approximately 200, 300, 400, 500, 600, 700, 800, 900, 1000, 
1 1 00, 1 200 or 1 300 base pairs. 

The invention also describes those oligonucleotides which specifically hybridize 
under stringent conditions to nucleic acids which are represented by one of the 
sequences as set forth in SEQ ID NO. 1, 3, 5, 7, 9 or 1 1. Correspondingly stringent 
conditions are, for example, 6 x sodium chloride/sodium citrate (SSC) at 
approximately 45°C, followed by a washing step with 2 x SSC at 50°C, and are 
familiar to the person skilled in the art (see, for example, Current protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6). Thus the salt 
concentrations in the washing step can be chosen such that the stringency is lower 
(2 x SSC, 50°C) or is higher (0.2 x SSC, 50°C). Furthermore, the temperature in the 
washing step can be varied, from conditions for a low stringency (e.g. about 22°C), 
up to conditions of higher stringency (e.g. about 65°C). Both salt concentration and 
temperature can be varied and tailored to one another. 

Particularly preferred oligonucleotides which can be used as primers or for 
hybridization for the identification and characterization of an existing, e.g. genomic, 
DNA, are described in SEQ ID NO. 12-51. However, other oligonucleotides can also 
25 be derived from the sequences as set forth in SEQ ID NO. 1, 3, 5, 7, 9 or 1 1, which 
can then be used as primers or for hybridization. Particularly preferred for the 
identification of the species or genus to which an existing DNA can be assigned are 
those oligonucleotides which hybridize in the region of the intron of an existing 
(genomic) DNA. The introns of a genomic DNA coding for p-tubulin, which can be 
30 isolated from the abovementioned nematodes, are described by way of example of 
Cylicocyclus nassatus in SEQ ID NO. 11. The introns are thus those sequences which 



15 
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are located between the coding exons. In a preferred embodiment, the described 
primers or hybridization probes contain a labeled group which makes possible the 
detection of the oligonucleotides, i.e. for example, radioisotopes, fluorescent groups, 
enzymes or enzyme cofactors. 

5 

Such oligonucleotides can be employed in diagnostic test kits in order to determine 
the origin (i.e. the organism) of an existing DNA. The oligonucleotides are suitable 
due to their specific hybridization with the DNAs mentioned under SEQ ID NO. 1, 3, 
5, 7, 9 and 1 1 [lacuna] their fragments, sequences homologous thereto and sequences 

10 complementary thereto for recognizing defined sequences. Thus they also make 
possible the recognition and identification of those sequences coding for B-tubulin 
which, on account of one or more point mutations in codon 200, lead to the 
expression of benzimidazole-resistant 8-tubulin. These oligonucleotides derived from 
the sequences SEQ ID NO. 1, 3, 5, 7, 9 and 11, preferably the embodiments 

15 described under SEQ ID NO. 12 to 51, are thus suitable for the identification of the 
frequently occurring nematode species of the subfamily of the cyathostominae, and 
for the recognition of existing resistances to benzimidazoles. 



The oligonucleotides according to the present invention can be prepared using 
20 standard methods which are familiar to the person skilled in the art, e.g. by de novo 
DNA synthesis. 



The nucleic acids mentioned here can be present in partially purified or biologically 
pure form in complete cells or in cell lysates, i.e. if other cell components or 
25 chemical precursors and by-products have been separated off in the case of a 
chemical synthesis of the DNA. 



Nucleic acids coding for B-tubulin, as described above, can be obtained starting from 
mRNA which is present in a number of eukaryotic cells. It is also possible to obtain 
30 the DNA according to the invention starting from genomic DNA from the nematode 
cells concerned (see also the following examples). A gene coding for B-tubulin can 
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be obtained, for example, from a cDNA library or a genomic DNA library. cDNA 
can be obtained by isolating the total mRNA of a cell, e.g. of a nematode cell. 
Starting from the mRNA, double-stranded cDNA can then be prepared and inserted 
into a suitable plasmid or a suitable vector. The DNA according to the invention can 
5 also be obtained by amplification with the aid of the known polymerase chain 

reaction (PCR) or alternatively by de novo DNA synthesis (see also J. Sambrook et 
al. (1989) Molecular Cloning, 2nd Edition, Chap. 14). 

Vectors and plasmids 

10 

The present invention also comprises expression vectors which contain one of the 
nucleic acid sequences according to the invention, which are functionally linked to a 
transcription-regulatory sequence. "Functionally linked" means that the nucleic acid 
sequence is linked to the regulatory sequence in a manner such that the expression of 

1 5 the protein encoded by the nucleic acid sequence can be controlled. "Transcription- 
regulatory sequences" include, for example, promoters, enhancers and other control 
elements. The expression vectors contain, for example, a gene coding for a B-tubulin 
according to the invention or fragments thereof. These vectors can be used for 
incorporation into cells where the corresponding polypeptides or alternatively fusion 

20 proteins are then formed. Suitable promoters for the expression of the protein 
according to the invention in E.coli include natural hybrid or bacteriophage 
promoters. Preferably, they are promoters from the group of the phage X promoters, 
omp promoters or synthetic promoters (see also WO 98/15625, DE 3 430 683, 
EP 0 173 149). Suitable vectors are commercially obtainable, e.g. the expression 

25 vectors of the pET series (e.g. pET3a, pET23a, pET28a and His tag or pET32a with 
His tag) or pGEX with glutathion synthetase fusion. The expression vectors can then 
be transformed, for example, into DE3-lysogenic E.coli strains, e.g. BL21(DE3), 
HM S 174(DE3) or AD494(DE3). 
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Rx pression of the B-tubulin polypeptides 

The present invention also comprises cells which contain the nucleic acid sequences 
according to the invention (e.g. inserted into a vector or into the genome). These host 
cells can be prokaryotic or eukaryotic. 

Suitable prokaryotic expression systems are, for example, bacterial systems such as 
Streptomyces spp., Bacillus subtilis, Salmonella typhimurium, Serratia marcescens 
and preferably E.coli. 

A preferred eukaryotic expression system is the baculovirus system, particularly 
preferably that which allows post-translational modifications. 

Other eukaryotic expression systems (e.g. yeast, insect cells) can likewise be used. 

Polypeptides 

The present invention likewise comprises B-tubulin polypeptides which are encoded 
by the DNAs according to the invention, preferably the DNA sequences as set forth 
in SEQ ID NO. 1, 3, 5, 7, 9 and 1 1, by fragments thereof or by homologous DNA 
sequences as described above. Preferred embodiments of these B-tubulin 
polypeptides are described in the sequences as set forth in SEQ ID NO. 2, 4, 6, 8 and 
10. In a preferred embodiment, the described polypeptides are purified polypeptides 
which are free of contaminating proteins of those cells in which the polypeptides 
according to the invention have been produced. 

The polypeptides described are proteins of full length or fragments, motifs or 
domains thereof which comprise lengths of at least 5, 10, 25, 50, 75, 100, 125, 150, 
200, 250, 300, 350 or 400 amino acids. Polypeptide fragments can be obtained and 
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are selected by the testing of polypeptides which are encoded by nucleic acid 
fragments derived from the sequences as set forth in SEQ ID NO. 1, 3, 5, 7, 9 and 1 1. 

Polypeptide fragments can also be chemically synthesized in a known manner. 

The invention also comprises polypeptides which are encoded by the degenerate 
sequence as set forth in SEQ ID NO. 3. Owing to the various possible bases in 
defined positions of the DNA sequence, various polypeptides result having an amino 
acid encoded by the codon resulting in each case. The polypeptides encoded by DNA 
as set forth in SEQ ID NO. 3 are described in SEQ ED NO. 4, the variable amino 
acids being marked by "Xaa". 

Preferred embodiments of the polypeptides are described in SEQ ID NO. 2, 4, 6, 8 
and 10. 

The present invention comprises all processes for the preparation of the polypeptides 
according to the invention. 

It is known to the person skilled in the art that the polypeptides of the present 
invention can be obtained in various ways, e.g. by chemical methods such as the 
solid-phase method. For the obtainment of larger amounts of protein, the use of 
recombinant methods is recommended. 

The underlying steps for the preparation of the recombinant 6-tubulin are: 

1 . Obtainment of a natural, synthetic or semisynthetic DNA which codes for the 
B-tubulin according to the invention. 

2. Incorporation of this DNA into an expression vector which is suitable for 
expressing the 6-tubulin according to the invention, either on its own or as a 
fusion protein. 
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3. Transformation of a suitable, preferably prokaryotic, host cell using this 
expression vector. 

5 4. Growth of this transformed host cell in a manner which is suitable for 
expressing the B-tubulin according to the invention. 

5. Harvesting of the cells and purification of the G-tubulin by means of suitable, 
known methods. 

For example, the expression vectors can be transformed in XDE3-lysogenic E. coli 
strains, e.g. BL21(DE3), HM S174(DE3) or AD494(DE3). After the growth of the 
cells under the standard conditions familiar to the person skilled in the art, expression 
is induced using IPTG. After induction of the cells, incubation is carried out at 
temperatures of 18 to 37°C for 3 to 24 hours. The cells are disrupted, the expressed 
protein is purified by means of chromatographic methods, in the case of protein 
expressed with His tag by means of FPLC on an Ni-NTA column, and also by ion- 
exchange chromatography, gel filtration chromatography, ultrafiltration, 
electrophoresis or alternatively immunoaffinity purification, which are specific for 
the polypeptides according to the invention. 

Homologs or fragments of the polypeptides according to the invention can be 
generated by mutagenesis, such as, for example, by directed (point) mutagenesis, or 
by deletions. 
25 

The polypeptides according to the invention can also be chemically modified, e.g. 
with glycosyl groups, lipids, phosphates, acetyl groups or similar groups. Covalent 
derivatives can be obtained by the linkage of the modifying group with functional 
groups of the amino acid side chains or the N terminus or C terminus of the 
30 polypeptide. 



15 
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In the expression of the polypeptides according to the present invention, it may be 
advantageous to change certain codons in order to make possible optimum 
expression. This is true if the use of certain codons ("codon usage") in the 
heterologous expression system is different than in one of the organisms according to 
5 the invention. Furthermore, the deletion of the 5 1 - or 3 '-untranslated region is 
possible, e.g. if several destabilizing sequence motifs (e.g. ATTTA) are present in the 
3' region of the cDNA. 



Fusion proteins 

10 

The polypeptides according to the invention can also be present as part of a fusion 
protein. Such fusion proteins are embraced in full by the present invention. Fusion 
proteins can be useful under conditions where it is desirable to obtain an 
immunogenic fragment of the B-tubulin (see, for example, EP 0 259 149; Schlienger 

15 et al. (1992) J. Virol. 66, 2). Under certain circumstances, fusion proteins facilitate 
the expression of a polypeptide. For example, the polypeptides according to the 
invention can be prepared as glutathione S-transferase (GST) fusion proteins. Such 
GST fusion proteins facilitate easy purification of the polypeptide (see, for example, 
Current Protocols in Molecular Biology, eds. Ausubel et al. (John Wiley & Sons, 

20 N.Y. 1991). Fusion proteins can contain, for example, a "leader" sequence which is 
used for the purification, e.g. allows a poly-His sequence at the N terminus (but also 
at the C terminus) of the protein whose purification [lacuna] by means of 
chromatography on an Ni 2+ NTA column (see, for example, Hachuli et al. (1987) J. 
Chromatography 4JJ., 177). 

25 

Techniques for preparing such fusion proteins are familiar to the person skilled in the 
art. 
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Antibodies 

Another aspect of the present invention relates to antibodies which react specifically 
5 with the B-tubulin polypeptides according to the invention. 

For example, anti-protein or anti-peptide antisera or monoclonal antibodies can be 
prepared according to standard protocols by the use of immunogens which have been 
derived from p-tubulin polypeptides according to the invention (see, for example, 
10 Antibodies: A. Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor 
Press, 1988)). 

Mammals such as mice, hamsters or rabbits can be immunized using an 
immunogenic form or an immunogenic fraction of the polypeptide according to the 

15 invention, e.g. using a polypeptide which is able to produce an antibody response 
(see also "fusion proteins" above). The appropriate techniques are familiar to the 
person skilled in the art. Thus an immunogenic fraction of the B-tubulin can be 
administered in the presence of an adjuvant. The course of the immunization can be 
observed by checking the antibody titer in the plasma or serum, e.g. by customary 

20 ELISA assays or other immunoassays. 

In a preferred embodiment, the antibodies according to the invention are 
immunospecific for an antigenic determinant of a B-tubulin polypeptide according to 
the invention, e.g. of a polypeptide as set forth in SEQ ID NO. 2, 4, 6, 8 or 10 or 
25 those polypeptides which are encoded by DNAs as set forth in SEQ ID NO. 1, 3, 5, 7, 
9 or 1 1 or sequences identical to 85% therewith, preferably sequences identical to 
95%. 

After the immunization of a mammal, polyclonal anti-B-tubulin antibodies can be 
30 isolated from the serum. For the production of monoclonal antibodies, antibody- 
producing cells (lymphocytes) can be obtained from an immunized animal and fused 
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according to known methods with immortal cells such as myeloma cells in order to 
obtain hybridoma cells (see, for example, Kohler and Milstein (1975) Nature 256 . 
495-497; Kozbar et at. (1983) Immunology Today 4, 72; Cole et al. (1985) 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. pp. 77-96). 

The "antibodies" mentioned here should also include fragments of antibodies which 
react specifically with 6-tubulin according to the invention. Antibodies can be 
fragmented using conventional techniques and the fragments tested. 



10 A preferred embodiment relates to antibodies such as described above, which carry a 
detectable label (e.g. radioisotopes, fluorescent groups, enzymes or enzyme 
cofactors). 

Antibodies which bind specifically to B-tubulin polypeptides according to the 
1 5 invention can also be used for the immunohistochemical staining of tissue samples in 
order to detect the expression of a specific p-tubulin. The anti-B-tubulin antibodies 
can likewise be used for diagnostic purposes, e.g. for immunoprecipitation or for 
immunoblotting. 



20 Diagnostic test procedures 



The present invention likewise makes available nucleic acid molecules which can be 
used for diagnostic purposes. 

25 These include nucleic acid molecules such as described above, the fragments of the 
DNA sequences described under SEQ ID NO. 1, 3, 5, 7, 9 or 1 1 or DNA sequences 
complementary thereto. For example, oligonucleotides as set forth in SEQ ID NO. 12 
to 5 1 are made available, which are able to hybridize to sense or antisense sequences 
coding for B-tubulin, and to intron sequence sections which are described by way of 

30 example in SEQ ID NO. 1 1 . 
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In this procedure, the nucleic acid of a cell is rendered accessible to hybridization, the 
DNA probe is brought into contact with the oligonucleotides, and the hybridization 
of the sample is detected using the oligonucleotide. 

5 In this manner, a method is made available which makes it possible to differentiate 
between various species of small strongylids and/or other nematode species by means 
of the specific hybridization of the oligonucleotides according to the invention to a 
DNA probe, preferably with the aid of those oligonucleotides which hybridize to the 
intron regions of the DNA coding for 6-tubulin. In a particularly preferred 

1 0 embodiment, the oligonucleotides according to the invention allow the identification 
of resistances in small (Cyathostaminae) e.g. in horses, especially resistances of the 
species Cylicocyclus nassatus^ Cyathostomum coronatum and Cyathostomum 
catinatum. In this procedure, the fact can be used that resistant forms of the J3-tubuIin 
carry at least one point mutation in the DNA according to the invention coding 

1 5 therefor, which can be detected by means of PCR, such as described, for example, in 
an analogous manner in Elard et al. (1998) PCR diagnosis of benzimidazole- 
susceptibility or -resistance in natural populations of the small ruminant parasite, 
Teladorsagia circumcincta. Veterinary Parasitology 80, 231-237. 

20 The method described is particularly helpful for the assessment of possible treatment 
strategies in humans and animals affected with nematodes, such as, for example, 
horses, sheep, pigs, goats, camels, buffalo, donkeys, hares, roe deer, fur-bearing 
animals, birds (e.g. chickens, turkeys, ducks), fresh- and salt-water fish (e.g. trout, 
carp). It makes possible the identification and differentiation of the parasitic 

25 nematodes and the recognition of resistant populations thereof, and avoid a treatment 
with inactive nematicides. 

The methods described here can be made available, for example, in the form of 
prefabricated diagnostic test kits which contain at least one of the abovementioned 
30 nucleic acid molecules or an antibody such as described above, which is prepared in 
ready^to-use form. 
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Procedure for the discovery of nematicidal substances 

The invention relates to a procedure in which, with the aid of tubulin or fragments 
5 thereof, novel, specific anthelmintic substances can be identified. 

In a preferred embodiment, p-tubulin polypeptides according to the present invention 
are used for this. However, the procedure can also be carried out using tubulin from 
species other than those mentioned here. Procedures which use p-tubulin 

10 polypeptides other than those according to the invention are included in full by the 
present invention. Particularly preferably, P-tubulin polypeptides as set forth in SEQ 
ID NO. 2, 4, 6, 8 or 10 are used for the procedure mentioned. In the present 
invention, recombinant P-tubulin polypeptides from frequently occurring parasitic 
nematodes are thus additionally made available. These can be used in various test 

15 systems for the identification of new inhibitors of tubulin interaction, or the 
interaction of tubulin subunits. 

Cell-free test systems 

20 Many test systems which have the testing of compounds and natural extracts as their 
aim are aimed at high throughput numbers in order to maximize the number of 
substances investigated in a given period of time. Test systems which are based on 
cell-free studies use purified or semi-purified protein. They are suitable for a "first" 
test, which primarily aims at detecting a possible influence of a substance on the 

25 target protein. 

Effects such as cell toxicity are as a rule ignored in these in vitro systems. The test 
systems in this case check both inhibitory and suppressive effects of the substances, 
and stimulatory effects. The effectiveness of a substance can be checked by means of 
30 concentration-dependent test series. Control batches without test substances can be 
used for the assessment of the effects. 
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One possibility for the identification of substances which modulate the interaction of 
tubulin or of its subunits is the "scintillation proximity assay" (SPA), see EP 015 
473. This test system uses the interaction of a receptor (e.g. tubulin) with a 
5 radiolabeled ligand (e.g. a small organic molecule or a second, radiolabeled protein 
molecule). The receptor is in this case bound to small spheres ("microspheres") or 
beads which are provided with scintillating molecules. In the course of the decay of 
the radioactivity, the scintillating substance in the microsphere is excited by the 
subatomic particles of the radioactive label and a detectable photon is emitted. The 
1 0 test conditions are optimized such that only those particles emanating from the ligand 
lead to a signal which emanate from a ligand bound to the receptor or the tubulin. 

In one possible embodiment, tubulin is bound to the beads, either together or without 
interacting or binding test substances, a- or p-Tubulin subunits could be used here. A 

15 radiolabeled ligand could be, for example, a labeled benzimidazole or a further, 
labeled P-tubulin molecule. In the case of binding of the ligand to the immobilized 
tubulin, this ligand must inhibit or abolish an existing interaction between 
immobilized and free tubulin in order to bind itself in the region of the contact 
surface. Binding to the immobilized tubulin which has taken place can then be 

20 detected by means of a flash of light. Correspondingly, an existing complex between 
an immobilized and a free, labeled tubulin is destroyed by the binding of a test 
substance, which leads to a decrease in the detected intensity of the flash of light. The 
test system then corresponds to a complementary inhibition system. 

25 Test system based on cells 

The P-tubulin available by means of the present invention, but also tubulin from 
other species, makes possible the development of test systems, which are based on 
cells, for the identification of substances which inhibit the tubulin interaction. 

30 
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An example of such a test system is the "two hybrid system". A specific example of 
this is the "interaction trap". This is a genetic selection of interacting proteins in yeast 
(see, for example, Gyuris et al. (1993) Cdi 1, a human Gl and S phase protein 
phosphatase that associates with Cdk 2. Cell 75, 791-803). The test system is 
5 designed to detect and to describe the interaction of two proteins in that an 
interaction which has taken place leads to a detectable signal. 



Such a test system can also be adapted to the testing of large numbers of test 
substances in a given period of time. 

10 

The system is based on the construction of two vectors, the "bait" vector and the 
"prey" vector. A gene coding for tubulin, preferably a gene coding for a P-tubulin 
according to the invention, is cloned in the bait vector and then expressed as a fusion 
protein with the LexA protein, a DNA-binding protein. A second gene, coding for 

15 tubulin, preferably for a P-tubulin according to the invention, is cloned in the prey 
vector, where it is expressed as a fusion protein with the B42 prey protein. Both 
vectors are present in a Saccharomyces cerevisiae host, which contains copies of a 
LexA-binding DNA on the 5 '-side of a lacZ or HIS3 reporter gene. If an interaction 
takes place between the two tubulin (fusion) proteins, activation of the transcription 

20 of the reporter gene occurs. If the presence of a test substance leads to the inhibition 
or disturbance of the tubulin interaction, the two tubulin (fusion) proteins can no 
longer interact and the product of the reporter gene is no longer prepared. 



With the aid of tubulin, particularly of the p-tubulin according to the invention or 
25 fragments thereof, and the processes described above, it is possible to identify new 
and specific antiparasitic compounds. 

Compounds which are found with the aid of the processes and polypeptides described 
are valuable for the treatment of humans and animals which are infected with 
30 pathogenic endoparasites of the human or of agricultural animals, pets, zoo animals, 
and laboratory and experimental animals. 
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The compounds are active against all stages of development of normal, sensitive 
strains and also resistant strains. By treatment with agents which contain one or more 
of these compounds, both economic losses in the case of agricultural animals and 
5 diseases in humans and animals can be avoided or treated. The following parasites 
are in this case of particular interest as targets of the active compounds found: 

Enoplida , e.g. Trichuris spp., Capillaria spp., Trichomosoides spp., Trichinella spp. 
Rhabditia , e.g. Micronema spp., Strongyloides spp. 

10 Strongylida , e.g. Strongylus spp., Triodontophorus spp., Oesophagodontus spp., 
Trichonema spp., Gyalocephalus spp., Cylindropharynx spp., Poteriostomum spp., 
Cyclococercus spp., Cylicostephanus spp., Oesophagostomum spp., Chabertia spp., 
Stephanurus spp., Ancylostoma spp., Uncinaria spp., Bunostomum spp., Globoce- 
phalus spp., Syngamus spp., Cyathostomum spp., Cylicocyclus spp., Neostrongylus 

15 spp., Cystocaulus spp., Pneumostrongylus spp., Spicocaulus spp., Elaphostrongylus 

spp., Parelaphostrongylus spp., Crenosoma spp., Paracrenosoma spp., Angiostron- 
gylus spp., Aelurostrongylus spp., Filaroides spp., Parafilaroides spp., Tricho- 
strongylus spp., Haemonchus spp., Ostertagia spp., Marshallagia spp., Cooperia 
spp., Nematodirus spp., Hyostrongylus spp., Obeliscoides spp., Amidostomum spp., 

20 Ollulanus spp. 

Oxyurida , e.g. Oxyuris spp., Enterobius spp., Passalurus spp., Syphacia spp., v4.yp/- 
culuris spp., Heterakis spp. 

Ascaridia , e.g. Ascaris spp., Toxascaris spp., Toxocara spp., Parascaris spp., Anisakis 
spp., Ascaridia spp. 

25 Spirurida , e.g. Gnathostoma spp., Physaloptera spp., Thelazia spp., Gongylonema 

spp., Habronema spp., Parabronema spp., Draschia spp., Dracunculus spp. 
Filariida , e.g. Stephanofilaria spp., Parafilaria spp., Setaria spp., Z,oa spp., 
Dirojilaria spp., Litomosoides spp., Brugia spp., Wuchereria spp., Onchocerca spp. 
G i gantorh vnchi da , e.g. Filicollis spp., Moniliformis spp., Macracanthorhynchus spp., 

30 Prosthenorchis spp. 

Mastigophora (Flagellata) 
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TrvDanosomatidae. e.g. Trypanosoma b. brucei, T. b. gambiense, T. b. rhodesiense, 
T. congolense, T. cruzi, T. evansi, T equinum, T. lewisi, T. percae, T simiae, T. 
vivax, Leishmania brasiliensis, L. donovani, L. tropica 
Trichomonadidae , e.g. Giardia lambilia, G. canis. 
Sarcomastigophora fRhizopoda) , e.g. Entamoeba histolytica 
Hartmanellidae, e.g. Acanthamoeba sp., Hartmanella spp. 

Apicomplexa (Sporozoa*) , e.g. Eimeria acervulina, E. adenoides, E. alabahmensis, E. 
anatis, E. anseris, E. arloingi, E. ashata, E. auburnensis, E. bovis, E. brunetti, E. 
canis, E. chinchillae, E. clupearum, E. columbae, E. contorta, E. crandalis, E. 
debliecki, E. dispersa, E. ellipsoidales, E. falciformis, E. faurei, E. labbeana, E. 
leucarti, E. magna, E. maxima, E. media, E. meleagridis. E. meleagrimitis, E. mitis, 
E. necatrix, E. ninakohlyakimovae, E. ovis, E. parva, E. pavonis, E. perforans, E. 
phasani, E. piriformis, E. praecox, E. residua, E. scabra, E. spec, E. stiedai, E. suis, 
E. tenella, E. truncata, E. truttae, E. zuernii, Globidium spec, Isospora belli, I. 
canis, I. felis, I. ohioensis, I. rivolta, I. spec, / suis, Neospora caninum, 
Cystisospora spec. Cryptosporidium spec. 
Toxoplasmadidae , e.g. Toxoplasma gondii 




Sarcocvstidae , e.g. Sarcocystis bovicanis, S. bovihominis, S. neuvona, S. ovicanis, S. 
ovifelis, S. spec, S. suihominis 
Leucozoide , e.g. Leucozytozoon simondi 

Plasmodiidae , e.g. Plasmodium berghei, P. falciparum, P. malariae, P. ovale, P. 
vivax, P. spec 

Piroplasmea , e.g. Babesia argentina, B. bovis, B. canis, B. spec, Theileria parva, T. 
spec. 

Adeleina , e.g. Hepatozoon canis, H. spec. 
The following are furthermore of importance 

Myxospora and Microspora, e.g. Glugea spec, and Nosema spec, and Pneumocystis 
carinii, Ciliophora (Ciliata), e.g. Balantidium coli, Ichthiophthirius spec, 
Trichondina spec, or Epistvlis spec. 
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The compounds and agents found are likewise effective against protozoa of insects, 
such as those of the strain Microsporidia, particularly those of the order Nosema, 
very particularly those of the species Nosema apis , which are parasites of honeybees. 
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Examples 

Example 1 

Obtainment of (3-tubulin cDNA and genomic DNA 

mRNA was obtained with the aid of the Quick Prep® Micro mRNA kit (Pharmacia 
Biotech, Freiburg, Germany) from C. nassatus worms and from C. coronatum and 
C.catinatum worms using the Dynal® mRNA Direct Kit (Dynal, Hamburg, 
Germany). The worms were isolated from the large intestine of horses and 
differentiated microscopically according to the characterized structure of head and 
tail (see R. S. Lichtenfeld (1975), Helminths of domestic equids. Proceedings of the 
Helminthological Society, Washington, 42 (Special issue), 1-92). 



15 The synthesis of the cDNA was carried out with the aid of the "reverse transcription 
system" (Promega, Madison, USA) in the case of the mRNA from C. nassatus and 
using the superscript RTII reverse transcriptase (Gibco BRL Life Technologies) in 
the case of the mRNA from C. coronatum and C. catinatum. In the cases mentioned, 
oligonucleotides having a length of 1 5 base pairs were used. Incubation was carried 

20 out for one hour at 42°C. 



Genomic DNA was obtained from 4 to 40 adult worms using the QIA Amp-tissue kit 
(Qiagen, Hilden, Germany). In this procedure, the worms were digested for 2 hours at 
55°C with proteinase K and the genomic DNA extracted using "spin columns". 



25 



Le A 33 759-Foreign Countries 

„■;:, ii.i ...... L:;i ;. t.:» £w, ,„ il.'J ."ii ij ! ,■ ' 



Example 2 

Amplification of B-tubulin sequences 

5 The amplification of p-tubulin sequences of full length or of fragments can be carried 
out, for example, using AmpliTaq Gold™ polymerase (Perkin Elmer, Foster City, 
California, USA). 

An amplification of the p-tubulin sequences as set forth in SEQ ID NO. 1, 3, 5, 7, 9 
10 or 1 1 or fragments thereof can be carried out with the aid of the primers as set forth 
in SEQ ID NO 12-51. 

For the amplification of the cDNA from C. coronation, for example, the sequences as 
set forth in SEQ ID NO. 43 and 44 are suitable, for the amplification of the cDNA 
15 from C. catinatum, the sequences as set forth in SEQ ID NO. 40 and 42 are 

particularly suitable. 

The amplification of the C. nassatus cDNA and of the genomic DNA of all species 
according to the present invention was carried out in a total volume of 50 pi, 

20 containing 5 pi of 10 x buffer, 2.5 pi of MgCl 2 (25 raM), 2 pi of dNTP mix (2mM 
for each NTP), 1 pi of each specific primer (SEQ ID NO. 12 -47) (50 p mol/pl), 0.5 
pi (2.5 U) of polymerase and 1 - 5 pi of DNA template. When using degenerate 
primers (SEQ ID NO. 48-51), 1 pi of each primer of a cone, of 500 pmol/pl was 
employed. The annealing was carried out in the case of degenerate primers at 46°C, 

25 in the case of specific primers the temperature was varied according to the calculated 
melting temperature. The PCR cycles were chosen as follows: 

95°C for 10 min, then 35 - 40 cycles with 1 min denaturation at 94°C, 1 min 
annealing, 1 min at 72°C and a final step at 72°C for 10 min. In the amplification of 
30 cDNA from C. coronatum and C. catinatum, a "touchdown" PCR temperature 
program was carried out, which has the following profile: 
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firstly 15 cycles at 94°C for 30 sec, then 1 min at 60°C and 1 min at 72°C, followed 
by 15 cycles with 30 sec at 95°C, 55°C for 1 min and 72°C for 1 min and finally 10 
cycles at 95°C for 30 sec, then 45°C for 1 min and 72°C for 1 min. For the 
5 amplification of larger fragments (>1000 base pairs), the elongation phase at 72°C 
was lengthened to 2.30 min. 

Example 3 

10 PCR products from the amplification of cDNA or genomic DNA from C. nassatus, 
C. coronatum and C. catinatum were cloned with the aid of the "Original TA 
Cloning Kit" /Invitrogen, Leek, Netherlands), namely in the "Original TA Cloning®" 
vector. 
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Patent claims 

1 . A DNA coding for p -tubulin from Cyathostominae or fragments thereof. 

2. A DNA as claimed in claim 1, comprising 

a) a polynucleotide having at least 85% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 2; 

b) a polynucleotide having at least 85% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 4; 

c) a polynucleotide having at least 85% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 6; 

d) a polynucleotide having at least 85% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 8; 

e) a polynucleotide having at least 85% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 10. 

3. A DNA as claimed in claim 1, comprising 

a) a polynucleotide having at least 95% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 2; 

b) a polynucleotide having at least 95% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 4; 

c) a polynucleotide having at least 95% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 6; 
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d) a polynucleotide having at least 95% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 8; 

e) a polynucleotide having at least 95% identity to a polynucleotide 
coding for an amino acid sequence as set forth in SEQ ID NO. 10. 

4. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 1. 

5. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 3. 

6. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 5. 

7. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 7. 

8. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 9. 

9. A DNA as claimed in one of claims 1 to 3, comprising a sequence as set forth 
in SEQ ID NO. 11. 

10. A DNA as claimed in one of claims 1 to 3 and 5 to 9, characterized in that it 
originates from Cylicocyclus. 



11. 

30 



A DNA as claimed in one of claims 1 to 4, characterized in that it originates 
from Cyathostomum. 
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A DNA as claimed in one of claims 1 to 3 and 5 to 10, characterized in that it 
originates from Cylicocyclus nassatus. 

A DNA as claimed in one of claims 1 to 4 and 1 1 , characterized in that it 
originates from Cyathostomum coronatum. 

A DNA as claimed in one of claims 1 to 13, characterized in that it contains at 
least one base replacement in codon 200, which leads to the expression of a 
polypeptide having anthelmintic resistance. 

A DNA, characterized in that it is complementary to DNA as claimed in one 
of claims 1 to 14 or fragments thereof. 

An RNA, characterized in that it is complementary to DNA as claimed in one 
of claims 1 to 15. 

An expression construct, characterized in that it comprises DNA as claimed in 
one of claims 1 to 14 and a sequence linked functionally therewith, which 
makes possible the expression of the DNA. 

A vector, characterized in that it comprises DNA as claimed in one of claims 
1 to 14. 

A host cell, comprising DNA as claimed in one of claims 1 to 14, an 
expression construct as claimed in claim 17, or a vector as claimed in claim 
18. 

A polypeptide encoded by a DNA as claimed in one of claims 1 to 14 or 
fragments thereof. 
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21. A polypeptide as claimed in claim 20, consisting of or comprising an amino 
acid sequence as set forth in SEQ ID NO. 2. 

22. A polypeptide as claimed in claim 20, consisting of or comprising an amino 
acid sequence as set forth in SEQ ID NO. 4. 

23. A polypeptide as claimed in claim 20, consisting of or comprising an amino 
acid sequence as set forth in SEQ ID NO. 6. 

24. A polypeptide as claimed in claim 20, consisting of or comprising an amino 
acid sequence as set forth in SEQ ID NO. 8. 

25. A polypeptide as claimed in claim 20, consisting of or comprising an amino 
acid sequence as set forth in SEQ ID NO. 10. 

26. A polypeptide encoded by a DNA as claimed in claim 14. 

27. A process for the preparation of a polypeptide as claimed in one of claims 20 
to 26, comprising the expression of the polypeptide or fragments thereof in a 
prokaryotic or eukaryotic expression system. 

28. The use of DNA oligonucleotides which hybridize specifically to DNA as 
claimed in one of claims 1 to 1 5, preferably to noncoding DNA sections, for 
the detection of DNA which originates from Cyathostominae. 

29. The use of DNA which hybridizes specifically to DNA as claimed in one of 
claims 1 to 15 for the detection of DNA which originates from 
Cyathostominae and codes for a polypeptide as claimed in claim 26. 
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30. A procedure for the detection of Cyathostominae, characterized in that DNA 
as set forth in claim 28 is hybridized to DNA as claimed in one of claims 1 to 
15 and this is amplified by means of PCR. 

5 31. A procedure for the detection of Cyathostominae having anthelmintic 
resistance, characterized in that DNA as set forth in claim 29 is hybridized to 
DNA as claimed in one of claims 1 to 15 and this is amplified by means of 
PCR. 

10 32. A DNA oligonucleotide comprising one of the sequences as set forth in SEQ 
ID NO. 12 to SEQ ID NO. 51 or a sequence derived from one of the DNA 
sequences as claimed in claims 1 to 15. 

33. A diagnostic test kit comprising at least one of the oligonucleotides as 
15 claimed in claim 32 and/or antibodies as claimed in claim 35 or 36. 

34. A diagnostic test kit as claimed in claim 33, characterized in that the DNA 
oligonucleotides are provided with a detectable label. 

20 35. An antibody, characterized in that it reacts specifically with an epitope of a 
polypeptide as claimed in one of claims 20 to 26. 

36. An antibody as claimed in claim 35, characterized in that it is monoclonal. 

25 37. The use of antibodies as claimed in claim 35 or 36 as nematicides. 

38. The use of polypeptides as claimed in one of claims 20 to 26 for the 
production of vaccines. 



30 39. 



A procedure for the identification of substances which modulate the 
interaction of tubulin. 
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40. The procedure as claimed in claim 39, characterized in that 

a) the test substance is brought into contact with tubulin under those 
5 conditions which allow interaction of the tubulin molecules with one 

another and binding of the test substance to tubulin, 

b) the binding of the test substance which has taken place is detected by 
determining the ability of the tubulin protein molecules to interact 

1 0 with one another and 

c) the ability of the tubulin protein molecules to interact with one another 
in the presence of a test substance is compared with their ability to 
interact with one another in the absence of a test substance. 

15 

41. The procedure as claimed in claim 39 or 40, characterized in that the tubulin 
used is a polypeptide as claimed in one of claims 20 to 26. 

42. The procedure as claimed in one of claims 39 to 41, characterized in that, for 
20 the detection of a modulation of the tubulin interaction in the presence of a 

test substance, a test system based on cells is used. 

43. The procedure as claimed in one of claims 39 to 41, characterized in that, for 
the detection of a modulation of the tubulin interaction in the presence of a 

25 test substance, a cell-free test system is used. 

44. A substance which is identified in a procedure as claimed in one of claims 39 
to 43. 



30 45. 



The use of a substance as claimed in claim 44 for the production of an agent 
for the prophylactic or therapeutic treatment of nematode attack. 
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DNA coding for P-tubulin and its use 



Abstract 



The invention relates to DNA which codes for p-tubulin from nematodes of the 
family of the strongylidae, the polypeptide encoded by this DNA, the use of the DNA 
for the diagnosis of anthelmintic resistance in the nematodes concerned and for the 
identification of the species of parasitic nematodes, the use of the p-tubulin as a 
constituent of a vaccine, and a process for the identification of new anthelmintic or 
antibiotic compounds. 
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SEQUENCE LISTING 
<110> Bayer AG 

<120> DNA coding for P-tubulin and its use 

<130> Le A 33 759 

<140> 
<141> 

<160> 51 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1380 
<212> DNA 

<213> Cyathostomum coronatum 

<220> 

<221> CDS 

<222> (1) . . (1344) 

<400> 1 

atg cgt gag ate gtg cat gta caa get gga caa tgt gga aac caa att 4 8 

Met Arg Glu lie Val His Val Gin Ala Gly Gin Cys Gly Asn Gin He 
1 5 1G 15 



ggt tec aag ttt tgg gaa gtg ate tct gac gag cat ggc att aag ccc 96 
Gly Ser Lys Phe Trp Glu Val He Ser Asp Glu His Gly lie Lys Pro 



gat ggc aca tac cac gga gaa tct gat eta caa tta gaa cga ate aat 144 
Asp Gly Thr Tyr His Gly Glu Ser Asp Leu Gin Leu Glu Arg lie Asn 



gtg tac tat aat gaa gca cat gga ggc aaa tat gtc cca cgt gca gtt 
Val Tyr Tyr Asn Glu Ala His Gly Gly Lys Tyr Val Pro Arg Ala Val 



ctt gtt gat etc gag ccc gga act atg gat tec gtc cgt tec ggg cca 240 
Leu Val Asp Leu Glu Pro Gly Thr Met Asp Ser Val Arg Ser Gly Pro 



tac ggg caa ttg ttc cgc cct gat aac tac gtg ttt gga cag tct ggc 288 
Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr Val Phe Gly Gin Ser Gly 



gca gga aat aac tgg gca aaa ggt cac tac act gaa ggc get gaa ctt 336 

Ala Gly Asn Asn Trp Ala Lys Gly His Tyr Thr Glu Gly Ala Glu Leu 
100 105 110 

gtc gac aat gta eta gat gta gtg cga aaa gaa gca gaa gga tgt gac 3 84 

Val Asp Asn Val Leu Asp Val Val Arg Lys Glu Ala Glu Gly Cys Asp 
115 120 125 

tgt ctg cag ggc ttc cag eta act cac tea ctt gga gga ggt acc ggt 432 

Cys Leu Gin Gly Phe Gin Leu Thr His Ser Leu Gly Gly Gly Thr Gly 
130 135 140 



teg ggt at g ggc act etc etc ate tec aaa att egg gag gag tat cct 



480 



Ser Gly Met Gly Thr Leu Leu lie Ser Lys lie Arg Glu Glu Tyr Pro 
145 150 155 160 



gat aga ate atg tec teg ttc tec gtt gtc ccc tea cca aag gtc tec 
Asp Arg lie Met Ser Ser Phe Ser Val Val Pro Ser Pro Lys Val Ser 
165 170 175 

gac act gtt gtg gag cct tac aat get acc eta tec gtt cat cag ttg 
Asp Thr Val Val Glu Pro Tyr Asn Ala Thr Leu Ser Val His Gin Leu 
180 185 190 

gtt gaa aat aca gac gag act tat tgt att gac aat gaa gec ctg tat 
Val Glu Asn Thr Asp Glu Thr Tyr Cys lie Asp Asn Glu Ala Leu Tyr 
195 200 205 

gat att tgc ttc cgc act ttg aaa etc acg aac cca act tat gga gat 
Asp lie Cys Phe Arg Thr Leu Lys Leu Thr Asn Pro Thr Tyr Gly Asp 
210 215 220 

ctg aat cat ctt gtg tct gta aca atg tct ggt gtc acc aca tgt ctt 
Leu Asn His Leu Val Ser Val Thr Met Ser Gly Val Thr Thr Cys Leu 
225 230 235 240 

cgc ttc cct ggc caa ttg aat gec gat eta cgc aaa eta get gtt aac 
Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu Arg Lys Leu Ala Val Asn 

245 250 255 



atg gtt cca ttc cct cgt ctt cac ttc ttc atg cct ggt ttt get cct 
Met Val Pro Phe Pro Arg Leu His Phe Phe Met Pro Gly Phe Ala Pro 
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ctt tct get aaa ggt get cag get tac cgt get ctt acc gta gec gag 864 
Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg Ala Leu Thr Val Ala Glu 
275 280 285 

ctt aca cag cag atg ttt gat get aag aat atg atg get get tgc gac 912 
Leu Thr Gin Gin Met Phe Asp Ala Lys Asn Met Met Ala Ala Cys Asp 
290 295 300 

cct cga cat gga cgt tat etc acc gtc gca gee atg ttc cga gga aga 960 
Pro Arg His Gly Arg Tyr Leu Thr Val Ala Ala Met Phe Arg Gly Arg 
305 310 315 320 

atg age atg agg gaa gta gac gac cag atg atg tea gtg cag aac aag 
1008 

Met Ser Met Arg Glu Val Asp Asp Gin Met Met Ser Val Gin Asn Lys 
325 330 335 

aac tec tea tac ttc gta gag tgg ate ccg aac aac gtg aag acc get 
1056 

Asn Ser Ser Tyr Phe Val Glu Trp lie Pro Asn Asn Val Lys Thr Ala 

340 345 350 

gta tgc gac ate ccg cca cga gga ctg aag atg gec get acc ttc gtt 
1104 

Val Cys Asp lie Pro Pro Arg Gly Leu Lys Met Ala Ala Thr Phe Val 
355 360 365 

gga aac tea act gec ate caa gag ctg ttc aag cgc att tea gaa caa 

1152 

Gly Asn Ser Thr Ala lie Gin Glu Leu Phe Lys Arg lie Ser Glu Gin 
370 375 380 



ttt aca gee atg ttc cgc cgc aaa gcg ttc ttg cat tgg tac act ggt 
1200 



Phe Thr Ala Met Phe Arg Arg Lys Ala Phe Leu His Trp Tyr Thr Gly 
385 390 395 400 

gaa ggt atg gac gag atg gag ttc act gaa gca gag tec aac atg aat 
1248 

Glu Gly Met Asp Glu Met Glu Phe Thr Glu Ala Glu Ser Asn Met Asn 
405 410 415 

gat etc ate tec gag tac caa cag tac cag gaa gec acc get gac gac 
1296 

Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin Glu Ala Thr Ala Asp Asp 
420 425 430 

atg ggc gat ctt gat gcg gaa ggc get gaa gag get tat cct gag gaa 
1344 

Met Gly Asp Leu Asp Ala Glu Gly Ala Glu Glu Ala Tyr Pro Glu Glu 
435 440 445 

taaaccagca gatcgtgttg cgttgttcgt ttctct 
1380 



<210> 2 
<211> 448 
<212> PRT 

<213> Cyathostomum coronatum 
<400> 2 

Met Arg Glu He Val His Val Gin Ala Gly Gin Cys Gly Asn Gin He 
1 5 10 15 



Gly Ser Lys Phe Trp Glu Val He Ser Asp Glu His Gly lie Lys Pro 
20 25 30 



Asp Gly Thr Tyr His Gly Glu Ser Asp Leu Gin Leu Glu Arg lie Asn 
35 40 45 

Val Tyr Tyr Asn Glu Ala His Gly Gly Lys Tyr Val Pro Arg Ala Val 
50 55 60 

Leu Val Asp Leu Glu Pro Gly Thr Met Asp Ser Val Arg Ser Gly Pro 
65 70 75 80 

Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr Val Phe Gly Gin Ser Gly 
85 90 95 

Ala Gly Asn Asn Trp Ala Lys Gly His Tyr Thr Glu Gly Ala Glu Leu 
100 105 110 

Val Asp Asn Val Leu Asp Val Val Arg Lys Glu Ala Glu Gly Cys Asp 
115 120 125 

Cys Leu Gin Gly Phe Gin Leu Thr His Ser Leu Gly Gly Gly Thr Gly 
130 135 140 

Ser Gly Met Gly Thr Leu Leu lie Ser Lys lie Arg Glu Glu Tyr Pro 
145 150 155 160 

Asp Arg lie Met Ser Ser Phe Ser Val Val Pro Ser Pro Lys Val Ser 
165 170 175 

Asp Thr Val Val Glu Pro Tyr Asn Ala Thr Leu Ser Val His Gin Leu 
180 185 190 



Val Glu Asn Thr 
195 

Asp lie Cys Phe 
210 

Leu Asn His Leu 
225 

Arg Phe Pro Gly 

Met Val Pro Phe 
260 

Leu Ser Ala Lys 
275 

Leu Thr Gin Gin 
290 

Pro Arg His Gly 
305 

Met Ser Met Arg 

Asn Ser Ser Tyr 
340 

Val Cys Asp lie 



Asp Glu Thr Tyr 
200 

Arg Thr Leu Lys 
215 

Val Ser Val Thr 
230 

Gin Leu Asn Ala 
245 

Pro Arg Leu His 

Gly Ala Gin Ala 
280 

Met Phe Asp Ala 
295 

Arg Tyr Leu Thr 
310 

Glu Val Asp Asp 
325 

Phe Val Glu Trp 
Pro Pro Arg Gly 



Cys lie Asp Asn 

Leu Thr Asn Pro 

220 

Met Ser Gly Val 
235 

Asp Leu Arg Lys 
250 

Phe Phe Met Pro 
265 

Tyr Arg Ala Leu 

Lys Asn Met Met 
300 

Val Ala Ala Met 
315 

Gin Met Met Ser 
330 

lie Pro Asn Asn 

345 

Leu Lys Met Ala 



Glu Ala Leu Tyr 
205 

Thr Tyr Gly Asp 

Thr Thr Cys Leu 
240 

Leu Ala Val Asn 
255 

Gly Phe Ala Pro 
270 

Thr Val Ala Glu 
285 

Ala Ala Cys Asp 

Phe Arg Gly Arg 
320 

Val Gin Asn Lys 
335 

Val Lys Thr Ala 
350 

Ala Thr Phe Val 



355 



360 



365 



Gly Asn Ser Thr 
370 

Phe Thr Ala Met 
385 

Glu Gly Met Asp 

Asp Leu lie Ser 
420 

Met Gly Asp Leu 
435 



Ala lie Gin Glu 
375 

Phe Arg Arg Lys 
390 

Glu Met Glu Phe 
405 

Glu Tyr Gin Gin 

Asp Ala Glu Gly 
440 



Leu Phe Lys Arg 
380 

Ala Phe Leu His 

395 

Thr Glu Ala Glu 
410 

Tyr Gin Glu Ala 
425 

Ala Glu Glu Ala 



lie Ser Glu Gin 

Trp Tyr Thr Gly 
400 

Ser Asn Met Asn 
415 

Thr Ala Asp Asp 
430 

Tyr Pro Glu Glu 
445 



<210> 3 
<211> 1429 
<212> DNA 

<213> Cylicocyclus nassatus 

<220> 

<221> CDS 

<222> (1) . . (1362) 

<400> 3 

aag ttc tct act gca ata atg cgt gag ate gtg cat gta caa get gga 
Lys Phe Ser Thr Ala lie Met Arg Glu lie Val His Val Gin Ala Gly 



car tgt gga aac caa att ggy tec aag tty tgg gaa gtg ate tct gac 
Gin Cys Gly Asn Gin lie Xaa Ser Lys Phe Trp Glu Val lie Ser Asp 
20 25 30 



gag cac ggc att aag ccy gay ggc aca tac cay gga gaa tct gay yta 
Glu His Gly lie Lys Xaa Asp Gly Thr Tyr His Gly Glu Ser Asp Xaa 



caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aar 
Gin Leu Glu Arg lie Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 



tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 



tcr gtc cgy tcy ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 
Xaa Val Xaa Xaa Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 



gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 
Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

act gaa ggy get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 
Thr Glu Xaa Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 



gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 



Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 



ctt gga gga ggt acc gga teg rgt atg ggc acw etc etc ate tyc aaa 
Leu Gly Gly Gly Thr Gly Ser Xaa Met Gly Xaa Leu Leu He Xaa Lys 
- 145 150 155 160 



att egg gag gag tat cct gat aga ate atr tec teg ttc tyc gtt gtt 
He Arg Glu Glu Tyr Pro Asp Arg lie Xaa Ser Ser Phe Xaa Val Val 
165 170 175 



ccc tea cca aag gtc tyc gay acy gtt gtg gag ccg tac aat get acc 
Pro Ser Pro Lys Val Xaa Asp Xaa Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 



eta tec gtt cat cag ttg gtt gaa aat aca gac gar act twe tgt att 624 
Leu Ser Val His Gin Leu Val Glu Asn Thr Asp Glu Thr Xaa Cys lie 
195 200 205 



gac aat gaa get ctt tat gat att tgc ttc cgc acy ytg aaa etc acs 
Asp Asn Glu Ala Leu Tyr Asp He Cys Phe Arg Xaa Xaa Lys Leu Xaa 
210 215 220 



aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 
Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser Val Thr Met Ser 
225 230 235 240 



ggy gtc act aca tgy ctt cgc ttc cct ggc caa ttg rry gee gat ctw 
Xaa Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Xaa Ala Asp Xaa 
245 250 255 



768 
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cgt aaa eta get gtt aac atg gyt cca ttc cct cgt ctt cac tty tty 816 

Arg Lys Leu Ala Val Asn Met Xaa Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 



atg cct ggc ttt get ccc etc tct gey aaa ggc gey cag get tac cgt 864 
Met Pro Gly Phe Ala Pro Leu Ser Xaa Lys Gly Xaa Gin Ala Tyr Arg 
275 280 285 



get ctt act gta gee gag ctw acy caa yag atg ttc gat gee aaa aat 912 
Ala Leu Thr Val Ala Glu Xaa Xaa Gin Xaa Met Phe Asp Ala Lys Asn 
290 295 300 



atg atg gec get tgc gac cct cga cat gga crt tat etc ace gty gca 
Met Met Ala Ala Cys Asp Pro Arg His Gly Xaa Tyr Leu Thr Xaa Ala 
305 310 315 320 



gee atg ttc cga gga cga atg age ayg agg gar gta gac gac cag atg 
1008 

Ala Met Phe Arg Gly Arg Met Ser Xaa Arg Glu Val Asp Asp Gin Met 
325 330 335 



atg tea gtg cag 
1056 

Met Ser Val Gin 
340 

aac aac gtc aar 
1104 

Asn Asn Val Lys 
355 



aac aag aac tec 
Asn Lys Asn Ser 

ace gey gta tgc 

Thr Xaa Val Cys 
360 



tea tac ttc gta 

Ser Tyr Phe Val 
345 

gac att ccg ccr 
Asp lie Pro Xaa 



gag tgg att ccg 

Glu Trp lie Pro 
350 

aga gga ctg aaa 

Arg Gly Leu Lys 
365 



atg gec get acc ttc gtt gga aac yea act gee aty caa gag ctg tty 
1152 
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Met Ala Ala Thr Phe Val Gly Asn Xaa Thr Ala Xaa Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa tty aca get atg ttc cgc cgc aaa gcg tty 
1200 

Lys Arg He Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 

385 390 395 400 

ttg cat ygg tay act ggw gaa ggt atg gay gag atg gag ttc act gaa 
1248 

Leu His Xaa Tyr Thr Xaa Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gec gag tec aac atg aat gat etc ate tec gaa tac car caa tac cag 
1296 

Ala Glu Ser Asn Met Asn Asp Leu He Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

gaa get acm get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
1344 

Glu Ala Xaa Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gar gaa tagamcagca gaytgtgttg cgttgttcgt 
1392 

Glu Ala Tyr Pro Glu Glu 
450 

ttctctrtgt caatgegaaa tacacattga ttgcgtt 
1429 



<210> 4 
<211> 454 



<212> PRT 

<213> Cylicocyclus nassatus 



<400> 4 

Lys Phe Ser Thr Ala lie Met Arg 
1 5 



Gin Cys Gly Asn Gin lie Xaa Ser 
20 



Glu His Gly He Lys Xaa Asp Gly 
35 40 



Glu He Val His Val Gin Ala Gly 
10 15 

Lys Phe Trp Glu Val He Ser Asp 
25 30 

Thr Tyr His Gly Glu Ser Asp Xaa 
45 



Gin Leu Glu Arg lie Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 



Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 



Xaa Val Xaa Xaa Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 



Val Phe Gly Gin Ser Gly Ala Gly 
100 



Thr Glu Xaa Ala Glu Leu Val Asp 
115 120 



Glu Ala Glu Gly Cys Asp Cys Leu 
130 135 



Asn Asn Trp Ala Lys Gly His Tyr 
105 HO 

Asn Val Leu Asp Val Val Arg Lys 
125 

Gin Gly Phe Gin Leu Thr His Ser 
140 



Leu Gly Gly Gly Thr Gly Ser Xaa Met Gly Xaa Leu Leu lie Xaa Lys 
145 150 155 160 



lie Arg Glu Glu Tyr Pro Asp Arg lie Xaa Ser Ser Phe Xaa Val Val 
165 170 175 

Pro Ser Pro Lys Val Xaa Asp Xaa Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 

Leu Ser Val His Gin Leu Val Glu Asn Thr Asp Glu Thr Xaa Cys lie 
195 200 205 

Asp Asn Glu Ala Leu Tyr Asp He Cys Phe Arg Xaa Xaa Lys Leu Xaa 
210 215 220 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser Val Thr Met Ser 
225 230 235 240 

Xaa Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Xaa Ala Asp Xaa 
245 250 255 

Arg Lys Leu Ala Val Asn Met Xaa Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

Met Pro Gly Phe Ala Pro Leu Ser Xaa Lys Gly Xaa Gin Ala Tyr Arg 
275 280 285 

Ala Leu Thr Val Ala Glu Xaa Xaa Gin Xaa Met Phe Asp Ala Lys Asn 
290 295 300 

Met Met Ala Ala Cys Asp Pro Arg His Gly Xaa Tyr Leu Thr Xaa Ala 



Ala Met Phe Arg 

Met Ser Val Gin 
340 

Asn Asn Val Lys 
355 

Met Ala Ala Thr 
370 

Lys Arg lie Ser 
385 

Leu His Xaa Tyr 

Ala Glu Ser Asn 
420 

Glu Ala Xaa Ala 
435 



Gly Arg Met Ser 
325 

Asn Lys Asn Ser 

Thr Xaa Val Cys 
360 

Phe Val Gly Asn 
375 

Glu Gin Phe Thr 
390 

Thr Xaa Glu Gly 
405 

Met Asn Asp Leu 

Asp Asp Met Gly 
440 



Xaa Arg Glu Val 
330 

Ser Tyr Phe Val 
345 

Asp lie Pro Xaa 

Xaa Thr Ala Xaa 
380 

Ala Met Phe Arg 
395 

Met Asp Glu Met 
410 

lie Ser Glu Tyr 
425 

Asp Leu Asp Ala 



Asp Asp Gin Met 
335 

Glu Trp He Pro 
350 

Arg Gly Leu Lys 
365 

Gin Glu Leu Phe 

Arg Lys Ala Phe 
400 

Glu Phe Thr Glu 

415 

Gin Gin Tyr Gin 
430 

Glu Gly Ala Glu 
445 



Glu Ala Tyr Pro Glu Glu 



<210> 5 
<211> 1428 
<212> DNA 

<213> Cylicocyclus nassatus 



<220> 

<221> CDS 

<222> (1) . . (1362) 



<400> 5 

aag ttc tct act gca ata atg cgt 
Lys Phe Ser Thr Ala lie Met Arg 
1 5 

cag tgt gga aac caa att ggc tec 
Gin Cys Gly Asn Gin lie Gly Ser 
20 



gag ate gtg cat gta caa get gga 48 
Glu lie Val His Val Gin Ala Gly 
10 15 

aag ttt tgg gaa gtg ate tct gac 96 
Lys Phe Trp Glu Val lie Ser Asp 
25 30 



gag cac ggc att aag cct gat ggc aca tac cac gga gaa tct gat tta 
Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 



caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aaa 
Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 



tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 



teg gtc cgt tec ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 



288 



Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 



gtg ttt gga 
Val Phe Gly 

act gaa ggc 
Thr Glu Gly 
115 

gaa get gaa 
Glu Ala Glu 
130 

ctt gga gga 
Leu Gly Gly 
145 

att egg gag 
lie Arg Glu 

ccc tea cca 
Pro Ser Pro 

eta tec gtt 
Leu Ser Val 
195 



cag tct ggc 
Gin Ser Gly 
100 

get gaa ctt 
Ala Glu Leu 

gga tgt gac 
Gly Cys Asp 

ggt acc gga 
Gly Thr Gly 
150 

gag tat cct 
Glu Tyr Pro 
165 

aag gtc tec 
Lys Val Ser 
180 

cat cag ttg 
His Gin Leu 



gca gga aat 
Ala Gly Asn 
105 

gtt gac aat 
Val Asp Asn 
120 

tgt ctg cag 
Cys Leu Gin 
135 

teg ggt atg 
Ser Gly Met 

gat aga ate 
Asp Arg lie 

gac acc gtt 
Asp Thr Val 
185 

gtt gaa aat 
Val Glu Asn 
200 



aac tgg gca 
Asn Trp Ala 

gta eta gat 
Val Leu Asp 

ggc ttc cag 
Gly Phe Gin 
140 

ggc act etc 
Gly Thr Leu 
155 

atg tec teg 
Met Ser Ser 
170 

gtg gag ccg 
Val Glu Pro 

aca gac gag 
Thr Asp Glu 



aaa ggt cac 
Lys Gly His 
110 

gta gtg cga 
Val Val Arg 
125 

eta act cac 
Leu Thr His 

etc ate tec 
Leu lie Ser 

ttc tec gtt 
Phe Ser Val 
175 

tac aat get 
Tyr Asn Ala 
190 

act ttc tgt 
Thr Phe Cys 
205 



tac 336 
Tyr 

aaa 3 84 
Lys 

tea 432 
Ser 

aaa 480 
Lys 
160 

gtt 528 
Val 

acc 576 
Thr 

att 624 
lie 



gac aat gaa get ctt tat gat att tgc ttc cgc act ttg aaa etc acg 672 
Asp Asn Glu Ala Leu Tyr Asp He Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 720 
Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser Val Thr Met Ser 
225 230 235 240 

ggt gtc act aca tgt ctt cgc ttc cct ggc caa ttg aat gec gat eta 768 
Gly Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
245 250 255 

cgt aaa eta get gtt aac atg gtt cca ttc cct cgt ctt cac ttc ttt 816 
Arg Lys Leu Ala Val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct get aaa ggc get cag get tac cgt 864 
Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

get ctt act gta gee gag eta act caa cag atg ttc gat gec aaa aat 912 
Ala Leu Thr Val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

atg atg gec get tgc gac cct cga cat gga cgt tat etc ace gtc gca 960 
Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr Val Ala 
305 310 315 320 

gec atg ttc cga gga cga atg age atg agg gag gta gac gac cag atg 
1008 

Ala Met Phe Arg Gly Arg Met Ser Met Arg Glu Val Asp Asp Gin Met 

325 330 335 
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atg tea gtg cag 
1056 

Met Ser Val Gin 
340 

aac aac gtc aag 
1104 

Asn Asn Val Lys 
355 

atg gec get acc 
1152 

Met Ala Ala Thr 
370 

aag cgc att tea 
1200 

Lys Arg lie Ser 
385 

ttg cat tgg tat 
1248 

Leu His Trp Tyr 

gee gag tec aac 
1296 

Ala Glu Ser Asn 
420 

gaa get aca get 
1344 

Glu Ala Thr Ala 
435 



aac aag aac tec 
Asn Lys Asn Ser 

acc get gta tgc 

Thr Ala Val Cys 
360 

ttc gtt gga aac 

Phe Val Gly Asn 
375 

gaa caa ttc aca 

Glu Gin Phe Thr 
390 

act ggt gaa ggt 

Thr Gly Glu Gly 
405 

atg aat gat etc 
Met Asn Asp Leu 

gac gat atg ggc 

Asp Asp Met Gly 
440 



tea tac ttc gta 

Ser Tyr Phe Val 
345 

gac att ccg ccg 
Asp lie Pro Pro 

tea act gec ate 

Ser Thr Ala lie 
380 

get atg ttc cgc 

Ala Met Phe Arg 
395 

atg gac gag atg 

Met Asp Glu Met 
410 

ate tec gaa tac 

lie Ser Glu Tyr 
425 

gat etc gat gcg 
Asp Leu Asp Ala 



gag tgg att ccg 

Glu Trp lie Pro 
350 

aga gga ctg aaa 

Arg Gly Leu Lys 
365 

caa gag ctg ttc 
Gin Glu Leu Phe 

cgc aaa gcg ttc 

Arg Lys Ala Phe 
400 

gag ttc act gaa 

Glu Phe Thr Glu 
415 

cag caa tac cag 

Gin Gin Tyr Gin 
430 

gaa ggc get gaa 

Glu Gly Ala Glu 

445 



gag get tac cct gaa gaa tagacagcag attgtgttgc gttgttcgtt 
1392 

Glu Ala Tyr Pro Glu Glu 
450 

tctctgtgtc aatgcgaaat acacattgat tgcgtt 
1428 



<210> 6 
<211> 454 
<212> PRT 

<213> Cylicocyclus nassatus 
<400> 6 

Lys Phe Ser Thr Ala He Met Arg Glu He Val His Val Gin Ala Gly 
15 10 15 

Gin Cys Gly Asn Gin He Gly Ser Lys Phe Trp Glu Val He Ser Asp 
20 25 30 

Glu His Gly He Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 

Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 



Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
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Val Phe Gly Gin 
100 

Thr Glu Gly Ala 
115 

Glu Ala Glu Gly 
130 

Leu Gly Gly Gly 
145 

He Arg Glu Glu 

Pro Ser Pro Lys 
180 

Leu Ser Val His 
195 

Asp Asn Glu Ala 
210 

Asn Pro Thr Tyr 
225 



85 

Ser Gly Ala Gly 

Glu Leu Val Asp 
120 

Cys Asp Cys Leu 
135 

Thr Gly Ser Gly 
150 

Tyr Pro Asp Arg 
165 

Val Ser Asp Thr 

Gin Leu Val Glu 
200 

Leu Tyr Asp He 

215 

Gly Asp Leu Asn 
230 



90 

Asn Asn Trp Ala 
105 

Asn Val Leu Asp 

Gin Gly Phe Gin 
140 

Met Gly Thr Leu 
155 

lie Met Ser Ser 
170 

Val Val Glu Pro 
185 

Asn Thr Asp Glu 

Cys Phe Arg Thr 
220 

His Leu Val Ser 
235 



95 

Lys Gly His Tyr 
110 

Val Val Arg Lys 
125 

Leu Thr His Ser 

Leu lie Ser Lys 
160 

Phe Ser Val Val 
175 

Tyr Asn Ala Thr 
190 

Thr Phe Cys He 
205 

Leu Lys Leu Thr 

Val Thr Met Ser 
240 



Gly Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
245 250 255 
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Arg Lys Leu Ala 
260 

Met Pro Gly Phe 

275 

Ala Leu Thr Val 
290 

Met Met Ala Ala 
305 

Ala Met Phe Arg 

Met Ser Val Gin 
340 

Asn Asn Val Lys 
355 

Met Ala Ala Thr 
370 

Lys Arg lie Ser 
385 

Leu His Trp Tyr 



Val Asn Met Val 

Ala Pro Leu Ser 
280 

Ala Glu Leu Thr 
295 

Cys Asp Pro Arg 
310 

Gly Arg Met Ser 
325 

Asn Lys Asn Ser 

Thr Ala Val Cys 
360 

Phe Val Gly Asn 
375 

Glu Gin Phe Thr 
390 

Thr Gly Glu Gly 
405 



Pro Phe Pro Arg 
265 

Ala Lys Gly Ala 

Gin Gin Met Phe 
300 

His Gly Arg Tyr 
315 

Met Arg Glu Val 
330 

Ser Tyr Phe Val 
345 

Asp lie Pro Pro 

Ser Thr Ala lie 
380 

Ala Met Phe Arg 
395 

Met Asp Glu Met 
410 



Leu His Phe Phe 
270 

Gin Ala Tyr Arg 
285 

Asp Ala Lys Asn 

Leu Thr Val Ala 
320 

Asp Asp Gin Met 
335 

Glu Trp lie Pro 

350 

Arg Gly Leu Lys 
365 

Gin Glu Leu Phe 

Arg Lys Ala Phe 
400 

Glu Phe Thr Glu 
415 



Ala Glu Ser Asn Met Asn Asp Leu He Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 



Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 



Glu Ala Tyr Pro Glu Glu 
450 



<210> 7 
<211> 1429 
<212> DNA 

<213> Cylicocyclus nassatus 

<220> 

<221> CDS 

<222> (1) . . (1362) 

<400> 7 

aag ttc tct act gca ata atg cgt 

Lys Phe Ser Thr Ala He Met Arg 

1 5 

cag tgt gga aac caa att ggt tec 
Gin Cys Gly Asn Gin lie Gly Ser 
20 



gag ate gtg cat gta caa get gga 48 
Glu He Val His Val Gin Ala Gly 
10 15 

aag ttt tgg gaa gtg ate tct gac 96 
Lys Phe Trp Glu Val lie Ser Asp 
25 30 



gag cac ggc att aag ccc gat ggc aca tac cac gga gaa tct gac tta 144 
Glu His Gly He Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 



caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aaa 
Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 



teg gtc cgt tec ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 
Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 



gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 
Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 HO 

act gaa ggc get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 
Thr Glu Gly Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 

gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 
Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 

ctt gga gga ggt ace gga teg agt atg ggc act etc etc ate ttc aaa 
Leu Gly Gly Gly Thr Gly Ser Ser Met Gly Thr Leu Leu He Phe Lys 
145 150 155 160 



192 



240 



336 



384 



432 



480 



att egg gag gag tat cct gat aga ate ata tec teg ttc ttc gtt gtt 



lie Arg Glu Glu Tyr Pro Asp Arg lie lie Ser Ser Phe Phe Val Val 
165 170 175 



ccc tea cca aag gtc tec gac acc gtt gtg gag ccg tac aat get acc 576 
Pro Ser Pro Lys Val Ser Asp Thr Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 



624 



672 



eta tec gtt cat cag ttg gtt gaa aat aca gac gag act ttc tgt att 
Leu Ser Val His Gin Leu Val Glu Asn Thr Asp Glu Thr Phe Cys He 
195 200 205 

gac aat gaa get ctt tat gat att tgc ttc cgc act ttg aaa etc acg 
Asp Asn Glu Ala Leu Tyr Asp He Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 



aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 720 
Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser Val Thr Met Ser 
225 230 235 240 

ggt gtc act aca tgt ctt cgc ttc cct ggc caa ttg agt gee gat eta 768 
Gly Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Ser Ala Asp Leu 
245 250 255 



cgt aaa eta get gtt aac atg gtt cca ttc cct cgt ctt cac ttc ttt 
Arg Lys Leu Ala Val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct get aaa ggc get cag get tac cgt 
Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 



816 



864 
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get ctt act gta gec gag eta act caa cag atg ttc gat gec aaa aat 
Ala Leu Thr Val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 



atg atg gec get 
Met Met Ala Ala 
305 

gec atg ttc cga 
1008 

Ala Met Phe Arg 

atg tea gtg cag 
1056 

Met Ser Val Gin 
340 

aac aac gtc aag 
1104 

Asn Asn Val Lys 

355 

atg gec get acc 

1152 

Met Ala Ala Thr 
370 

aag cgc att tea 
1200 

Lys Arg lie Ser 



tgc gac cct cga 
Cys Asp Pro Arg 
310 

gga cga atg age 

Gly Arg Met Ser 
325 

aac aag aac tec 
Asn Lys Asn Ser 

acc get gta tgc 

Thr Ala Val Cys 
360 

ttc gtt gga aac 

Phe Val Gly Asn 
375 

gaa caa ttt aca 
Glu Gin Phe Thr 



cat gga cgt tat 
His Gly Arg Tyr 

315 

atg agg gag gta 

Met Arg Glu Val 
330 

tea tac ttc gta 

Ser Tyr Phe Val 
345 

gac att ccg ccg 
Asp lie Pro Pro 

tea act gec att 

Ser Thr Ala lie 
380 

get atg ttc cgc 
Ala Met Phe Arg 



etc acc gtc gca 
Leu Thr Val Ala 
320 

gac gac cag atg 

Asp Asp Gin Met 
335 

gag tgg att ccg 

Glu Trp lie Pro 
350 

ag a gg a ct g aaa 

Arg Gly Leu Lys 
365 

caa gag ctg ttc 
Gin Glu Leu Phe 

cgc aaa gcg ttc 
Arg Lys Ala Phe 



ttg cat tgg tat act ggt gaa ggt atg gac gag atg gag ttc act gaa 
1248 

Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gcc gag tec aac atg aat gat etc ate tec gaa tac caa caa tac cag 
1296 

Ala Glu Ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 

420 425 430 

gaa get acc get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
1344 

Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gag gaa tagaacagca gattgtgttg cgttgttcgt 
1392 

Glu Ala Tyr Pro Glu Glu 
450 

ttctctgtgt caatgegaaa tacacattga ttgcgtt 
1429 



<210> 8 
<211> 454 
<212> PRT 

<213> Cylicocyclus nassatus 
<400> 8 

Lys Phe Ser Thr Ala lie Met Arg Glu He Val His Val Gin Ala Gly 
15 10 15 



Gin Cys Gly Asn Gin He Gly Ser Lys Phe Trp Glu Val lie Ser Asp 



Glu His Gly He Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 



Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 



Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 



Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 



Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 HO 



Thr Glu Gly Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 



Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 



Leu Gly Gly Gly Thr Gly Ser Ser Met Gly Thr Leu Leu lie Phe Lys 
145 150 155 160 

lie Arg Glu Glu Tyr Pro Asp Arg lie He Ser Ser Phe Phe Val Val 
165 170 175 



Pro Ser Pro Lys Val Ser Asp Thr Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 



Leu Ser Val His 
195 

Asp Asn Glu Ala 
210 

Asn Pro Thr Tyr 
225 

Gly Val Thr Thr 

Arg Lys Leu Ala 
260 

Met Pro Gly Phe 
275 

Ala Leu Thr Val 

290 

Met Met Ala Ala 
305 

Ala Met Phe Arg 



Gin Leu Val Glu 
200 

Leu Tyr Asp lie 
215 

Gly Asp Leu Asn 
230 

Cys Leu Arg Phe 
245 

Val Asn Met Val 

Ala Pro Leu Ser 
280 

Ala Glu Leu Thr 
295 

Cys Asp Pro Arg 
310 

Gly Arg Met Ser 

325 



Asn Thr Asp Glu 

Cys Phe Arg Thr 
220 

His Leu Val Ser 
235 

Pro Gly Gin Leu 

250 

Pro Phe Pro Arg 
265 

Ala Lys Gly Ala 

Gin Gin Met Phe 
300 

His Gly Arg Tyr 
315 

Met Arg Glu Val 
330 



Thr Phe Cys lie 
205 

Leu Lys Leu Thr 

Val Thr Met Ser 
240 

Ser Ala Asp Leu 
255 

Leu His Phe Phe 

270 

Gin Ala Tyr Arg 
285 

Asp Ala Lys Asn 

Leu Thr Val Ala 
320 

Asp Asp Gin Met 
335 



Met Ser Val Gin Asn Lys Asn Ser Ser Tyr Phe Val Glu Trp He Pro 



Asn Asn Val Lys 
355 

Met Ala Ala Thr 
370 

Lys Arg lie Ser 
385 

Leu His Trp Tyr 

Ala Glu Ser Asn 
420 

Glu Ala Thr Ala 
435 

Glu Ala Tyr Pro 
450 



Thr Ala Val Cys 
360 

Phe Val Gly Asn 
375 

Glu Gin Phe Thr 
390 

Thr Gly Glu Gly 
405 

Met Asn Asp Leu 

Asp Asp Met Gly 
440 

Glu Glu 



Asp lie Pro Pro 

Ser Thr Ala lie 
380 

Ala Met Phe Arg 

395 

Met Asp Glu Met 
410 

lie Ser Glu Tyr 

425 

Asp Leu Asp Ala 



Arg Gly Leu Lys 
365 

Gin Glu Leu Phe 

Arg Lys Ala Phe 
400 

Glu Phe Thr Glu 
415 

Gin Gin Tyr Gin 
430 

Glu Gly Ala Glu 
445 



<210> 9 
<211> 1428 
<212> DNA 

<213> Cylicocyclus nassatus 

<220> 

<221> CDS 

<222> (1) . - (1362) 



<400> 9 

aag ttc tct act gca ata atg cgt gag ate gtg cat gta caa get gga 

Lys Phe Ser Thr Ala He Met Arg Glu He Val His Val Gin Ala Gly 
! 5 10 15 

cag tgt gga aac caa att ggt tec aag ttc tgg gaa gtg ate tct gac 
Gin Cys Gly Asn Gin He Gly Ser Lys Phe Trp Glu Val He Ser Asp 



gag cac ggc att aag ccc gac ggc aca tac cat gga gaa tct gat eta 
Glu His Gly He Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 



caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aag 
Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 



tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 



tea gtc cgt tct ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 288 
Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 



192 



240 



gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 
Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 HO 

act gaa ggc get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 



336 



Thr Glu Gly Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 



gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 
Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 



ctt gga gga ggt ace gga teg ggt atg ggc aca etc etc ate tec aaa 
Leu Gly Gly Gly Thr Gly Ser Gly Met Gly Thr Leu Leu He Ser Lys 
145 150 155 160 



att egg gag gag tat cct gat aga ate atg tec teg ttc tec gtt gtt 52 8 
He Arg Glu Glu Tyr Pro Asp Arg He Met Ser Ser Phe Ser Val Val 
165 170 175 



ccc tea cca aag gtc ttc gat act gtt gtg gag ccg tac aat get acc 
Pro Ser Pro Lys Val Phe Asp Thr Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 



eta tec gtt cat cag ttg 
Leu Ser Val His Gin Leu 
195 

gac aat gaa get ctt tat 
Asp Asn Glu Ala Leu Tyr 
210 

aac cca act tat gga gat 
Asn Pro Thr Tyr Gly Asp 
225 230 



gtt gaa aat aca gac gag 
Val Glu Asn Thr Asp Glu 
200 

gat att tgc ttc cgc acc 
Asp lie Cys Phe Arg Thr 
215 220 

ctg aat cat ctt gtg tet 
Leu Asn His Leu Val Ser 
235 



act ttc tgt att 624 

Thr Phe Cys He 

205 

ttg aaa etc acg 672 
Leu Lys Leu Thr 

gta aca atg tet 720 
Val Thr Met Ser 
240 



ggt gtc act aca tgc ctt cgc ttc cct 
Gly Val Thr Thr Cys Leu Arg Phe Pro 
245 

cgt aaa eta get gtt aac atg gtt cca 
Arg Lys Leu Ala Val Asn Met Val Pro 
260 265 



ggc caa ttg aat gec gat eta 768 
Gly Gin Leu Asn Ala Asp Leu 
250 255 

ttc cct cgt ctt cac ttc ttc 816 
Phe Pro Arg Leu His Phe Phe 
270 



atg cct ggc ttt get ccc etc tct gec aaa ggc gec cag get tac cgt 
Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 



get ctt act gta gec gag eta act caa cag atg ttc gat gec aaa aat 
Ala Leu Thr Val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 



atg atg gec get tgc 
Met Met Ala Ala Cys 
305 

gec atg ttc cga gga 
1008 

Ala Met Phe Arg Gly 
325 

atg tea gtg cag aac 
1056 

Met Ser Val Gin Asn 
340 

aac aac gtc aaa ace 
1104 

Asn Asn Val Lys Thr 



gac cct cga cat gga cgt 
Asp Pro Arg His Gly Arg 
310 315 

cga atg age atg agg gag 

Arg Met Ser Met Arg Glu 
330 

aag aac tec tea tac ttc 

Lys Asn Ser Ser Tyr Phe 
345 

get gta tgc gac att ccg 
Ala Val Cys Asp lie Pro 



tat etc acc gtc gca 960 
Tyr Leu Thr Val Ala 
320 

gta gac gac cag atg 

Val Asp Asp Gin Met 
335 

gta gag tgg att ccg 

Val Glu Trp lie Pro 
350 

ccg aga gga ctg aaa 
Pro Arg Gly Leu Lys 



atg gcc get acc ttc gtt gga aac tea act gee att caa gag ctg ttc 
1152 

Met Ala Ala Thr Phe Val Gly Asn Ser Thr Ala He Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa ttc aca get atg ttc cgc cgc aaa gcg ttc 
1200 

Lys Arg He Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 

385 390 395 400 

ttg cat tgg tat act ggt gaa ggt atg gac gag atg gag ttc act gaa 
1248 

Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gcc gag tec aac atg aat gat etc ate tec gaa tac cag caa tac cag 
1296 

Ala Glu Ser Asn Met Asn Asp Leu He Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

gaa get acc get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
1344 

Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gaa gaa tagacagcag attgtgttgc gttgttcgtt 
1392 

Glu Ala Tyr Pro Glu Glu 
450 



tctctgtgtc aatgcgaaat acacattgat tgcgtt 
1428 
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<210> 10 
<211> 454 
<212> PRT 

<213> Cylicocyclus nassatus 
<400> 10 

Lys Phe Ser Thr Ala He Met Arg Glu He Val His Val Gin Ala Gly 



Gin Cys Gly Asn Gin He Gly Ser Lys Phe Trp Glu Val He Ser Asp 



Glu His Gly He Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 



Gin Leu Glu Arg He Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 



Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 



Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 



Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 HO 



Thr Glu Gly Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 



Glu Ala Glu Gly 
130 

Leu Gly Gly Gly 
145 

lie Arg Glu Glu 

Pro Ser Pro Lys 
180 

Leu Ser Val His 
195 

Asp Asn Glu Ala 
210 

Asn Pro Thr Tyr 
225 

Gly Val Thr Thr 

Arg Lys Leu Ala 
260 

Met Pro Gly Phe 

275 

Ala Leu Thr Val 



Cys Asp Cys Leu 
135 

Thr Gly Ser Gly 
150 

Tyr Pro Asp Arg 
165 

Val Phe Asp Thr 

Gin Leu Val Glu 
200 

Leu Tyr Asp lie 
215 

Gly Asp Leu Asn 
230 

Cys Leu Arg Phe 
245 

Val Asn Met Val 

Ala Pro Leu Ser 
280 

Ala Glu Leu Thr 



Gin Gly Phe Gin 
140 

Met Gly Thr Leu 
155 

lie Met Ser Ser 
170 

Val Val Glu Pro 
185 

Asn Thr Asp Glu 

Cys Phe Arg Thr 
220 

His Leu Val Ser 
235 

Pro Gly Gin Leu 
250 

Pro Phe Pro Arg 

265 

Ala Lys Gly Ala 
Gin Gin Met Phe 



Leu Thr His Ser 

Leu lie Ser Lys 
160 

Phe Ser Val Val 
175 

Tyr Asn Ala Thr 
190 

Thr Phe Cys He 

205 

Leu Lys Leu Thr 

Val Thr Met Ser 
240 

Asn Ala Asp Leu 

255 

Leu His Phe Phe 
270 

Gin Ala Tyr Arg 
285 

Asp Ala Lys Asn 



290 



295 



300 



Met Met Ala Ala 
305 

Ala Met Phe Arg 

Met Ser Val Gin 
340 

Asn Asn Val Lys 
355 

Met Ala Ala Thr 
370 

Lys Arg He Ser 
385 

Leu His Trp Tyr 

Ala Glu Ser Asn 
420 

Glu Ala Thr Ala 
435 

Glu Ala Tyr Pro 



Cys Asp Pro Arg 
310 

Gly Arg Met Ser 
325 

Asn Lys Asn Ser 

Thr Ala Val Cys 
360 

Phe Val Gly Asn 
375 

Glu Gin Phe Thr 
390 

Thr Gly Glu Gly 
405 

Met Asn Asp Leu 

Asp Asp Met Gly 
440 

Glu Glu 



His Gly Arg Tyr 
315 

Met Arg Glu Val 
330 

Ser Tyr Phe Val 
345 

Asp He Pro Pro 

Ser Thr Ala He 
380 

Ala Met Phe Arg 
395 

Met Asp Glu Met 
410 

He Ser Glu Tyr 
425 

Asp Leu Asp Ala 



Leu Thr Val Ala 
320 

Asp Asp Gin Met 
335 

Glu Trp He Pro 
350 

Arg Gly Leu Lys 
365 

Gin Glu Leu Phe 

Arg Lys Ala Phe 
400 

Glu Phe Thr Glu 
415 

Gin Gin Tyr Gin 
430 

Glu Gly Ala Glu 
445 



450 



<210> 11 
<211> 2655 
<212> DNA 

<213> Cylicocyclus nassatus 
<220> 

<221> intron 
<222> (1) . . (18) 

<220> 

<221> intron 
<222> (1) . - (18) 

<220> 

<221> intron 
<222> (76) . . (358) 

<220> 

<221> intron 
<222> (469) . . (637) 

<220> 

<221> intron 

<222> (865) . . (1374) 

<220> 

<221> intron 

<222> (1666) . . (1723) 
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<220> 

<221> intron 

<222> (1915) . . (1966) 

<220> 

<221> intron 

<222> (2064) . . (2119) 

<220> 

<221> intron 

<222> (2306) . . (2354) 

<220> 

<221> intron 

<222> (2475) . . (2523) 

<220> 

<221> intron 

<222> (2592) . . (2655) 

<400> 11 

aagttctcta ctgcaataat gcgtgagatc gtgcatgtac aagctggaca rtgtggaaac 60 
caaattggtt ccaaggtrcg gtagtttyrr twktytrytg atcgtaattc sggmgytytr 12 0 
dagtrryttt ttycgytgyy ratgttgcat yrtgttgcga taaascyraa aawtcawwag 180 
rcgaggctgt aaaagsactt ytacttttra atmcrytgta gcagcatgag tcatcrgcat 24 0 
gtttgcagtg sgttttttat gcgcwgawcc yctcagaaga tgagaatgcg wtccaytgag 300 
cwtagartct grctttctct cgttawctaa ratcaamtta carcrytyca ttttkcagtt 360 
ytgggaagtg atctctgacg agcacggcat taagccygay ggcacatacc ayggagaatc 420 
tgatytacaa ttagaacgaa tcaatgtgta ctataatgaa gcacatggtt agtcgtayat 480 
ccgcttcgtt gtytcccmat gcagrccyct tcagttttta taactgycga aatatcgatc 540 
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gggctctttt gcagcggccw ygattacgca ataccayygc ygcygcagtg gcrgtcgaaa 600 

ttaatgtggt caracgtgaa aatgtggtgc tttyaggagg caartatgtc ccgcgtgcag 660 

ttcttgttga tctcgagccc ggaactatgg attcggtccg ytccgggcca tacgggcaat 72 0 

tgttccgccc tgataactac gtgtttggac agtctggcgc aggaaataac tgggcaaaag 780 

gtcactacac tgaaggygct gaacttgtcg acaatgtact agatgtagtg cgaaaagaag 840 

ctgaaggatg tgactgtctg caggtaaatt tccaagtagt agcaggaaat ggtwtgtgra 900 

tagcataaca aaagtcatag aaggaatatg gacgctagtc aaaacaaagw tggacgttar 960 

tcggtcgtcc gggacarttt ggaagtcatg ggtcasccaa cacgcttttt tamaagtaca 
1020 

tcatactctt ttcccacgaa aagctatttt gcgtattacg gggtacaggg gaggggtcaa 
1080 

aatcacagat tgctgaaaty tggttcactg ragttattgr tgaaaatcat attgattttg 
1140 

cttgctactg ccttcttttr aggctatgct ttacaatctt ggggcctgga taaccgaatt 
1200 

gtcygaagtt tttcggtcat cacggacggg gaaggggcat artatcgtta kttcttgkta 
1260 

tttcgcagca tatggcaatc tytccacttc tgacaagttt tcygtagaaa atatwcttca 
1320 

aggtstcaag aacyttgctg ctagrgctgt aaaccaayct gtatcycttt cagggcttcc 
1380 

agctaactca ctcacttgga ggaggtaccg gatcgggtat gggcactctc ctcatctcca 
1440 

aaattcggga ggagtatcct gatagaatca tgtcctcgtt ctccgttgtt ccctcaccaa 
1500 

aggtctccga caccgttgtg gagccgtaca atgctaccct atccgttcat cagttggttg 
1560 

aaaatacaga cgaracttwc tgtattgaca atgaagctct ttatgatatt tgcttccgca 
1620 

cyytgaaact cacsaaccca acttatggag atctgaatca tcttggtrrg yrayatkcsa 
1680 

ytgctgagct tdgtrgaatt tvctaattwt ktyhamtdty yagtgtctgt aacaatgtct 
1740 

ggygtcacta catgycttcg cttccctggc caattgrayg ccgatctwcg taaactagct 
1800 

gttaacatgg ytccattccc tcgtcttcac ttyttyatgc ctggctttgc tcccctctct 
1860 

gcyaaaggcg cycaggctta ccgtgctctt actgtagccg agctwacyca rcaggtgcgt 
1920 
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ctgcttatcr ttgwtgayrt gtgtttattc kttgtrtatt ttayagatgt tcgatgccaa 
1980 

aaatatgatg gccgcttgcg accctcgaca tggacrttat ctcaccgtyg cagccatgtt 
2040 

ccgaggacga atgagcayga gggtaagtgg mtkmttggyc ytytaryaya rctcrgacga 
2100 

awtgctgtta tgtcmtagga rgtagacgac cagatgatgt cagtgcagaa caagaactcc 
2160 

tcatacttcg tagagtggat tccgaacaac gtcaaraccg cygtatgcga cattccgccr 
2220 

agaggactga aaatggccgc taccttcgtt ggaaacycaa ctgccatcca agagctgtty 
2280 

aagcgcattt cagaacaatt yacaggttdg tttgtgcaya ttatggtgaa agcagattar 
2340 

ttgcgaygtt gcagctatgt tccgccgcaa agcgttyttg catyggtaya ctggwgaagg 
2400 

tatggaygag atggagttca ctgaagccga gtccaacatg aatgatctca tctccgaata 
2460 

ccarcaatac caggttcggc tgtytttcwt rgayactgtr tttaataatt wttyttgtct 
2520 

aggaagctac cgctgacgat atgggcgatc tcgatgcgga aggcgctgaa gaggcttacc 
2580 

ctgargaata gamcagcaga ytgtgttgcg ttgttcgttt ctctrtgtca atgcgaaata 
2640 

cacattgatt gcgtt 
2655 



<210> 12 
<211> 23 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: hybridization probe/primer 



<400> 12 



ggtttaatta tcccaagttt gag 



<210> 13 
<211> 20 
<212> DNA 

<213> Synthetic sequence 

<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 13 

ggccacgcgt cgactagtac 

<210> 14 
<211> 37 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 14 

ggccacgcgt cgactagtac tttttttttt ttttttt 

<210> 15 
<211> 22 



<212> DNA 

<213> Synthetic sequence 

<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 15 

gaccgctgta tgcgacattc eg 

<210> 16 
<211> 22 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 16 

aactcaactg ccatccaaga gc 

<210> 17 
<211> 21 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 



sequence: primer/hybridization probes 



<400> 17 

gctatgttcc gccgcaaagc g 

<210> 18 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probes 

<400> 18 

acgagcacgg cattaagcct gatgg 

<210> 19 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probes 

<400> 19 

ccatcaggct taatgccgtg ctcgt 



<210> 20 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 20 

ccgaatccat agttccgggc tcgag 



<210> 21 
<211> 24 
<212> DNA 

<213> Synthetic sequence 

<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 21 

ccgacaccgt tgtggagccg taca 



<210> 22 
<211> 25 
<212> DNA 

<213> Synthetic sequence 



<223> Description of the synthetic 

sequence : primer/hybridization probes 

<400> 22 

gcgaccctcg acatggacgt tatct 

<210> 23 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 23 

agataacgtc catgtcgagg gtcgc 

<210> 24 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 



<400> 24 

tgttccgagg acgaatgagc atgag 

<210> 25 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 25 

ctcatgctca ttcgtcctcg gaaca 

<210> 26 
<211> 26 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 26 

aggtagacga ccagatgatg tcagtg 
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25 



<210> 27 



<211> 26 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 27 

cactgacatc atctggtcgt ctacct 26 



<210> 28 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 28 

ggcggaatgt cgcatacagc ggtc 



<210> 29 
<211> 26 
<212> DNA 

<213> Synthetic sequence 
<220> 
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<223> Description of the synthetic 

sequence: primer/hybridization probes 



<400> 29 

cggagatgag atcattcatg ttggac 



<210> 30 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 30 

ctctcctcat ctccaaaatt cggg 



<210> 31 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 31 

cagctaactc actcacttgg aggag 
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<210> 32 
<211> 25 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 32 

aagttctcta ctgcaataat gcgtg 



<210> 33 
<211> 24 
<212> DNA 

<213> Synthetic sequence 




of the synthetic 



sequence: primer/hybridization probes 
<400> 33 

ggttgaaaat acagacgaga cttt 



<210> 
<211> 
<212> 



<213> Synthetic sequence 

<220> 

<223> Description of the synthetic 

sequence: primer/hybridization 

<400> 34 

ggttgaaaat acagacgaga ctta 

<210> 35 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization 

<400> 35 

aaagagcttc attgtcaata caga 

<210> 36 
<211> 23 
<212> DNA 

<213> Synthetic sequence 



probes 



24 



probes 



24 



<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 



<400> 36 

caattggcca gggaagcgaa gac 



23 



<210> 37 
<211> 23 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 37 

gtcttcgctt ccctggccaa ttg 

<210> 38 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 



<400> 38 

cctggctttg ctcccctctc tgct 



24 
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<210> 39 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 39 

agcagagagg ggagcaaagc cagg 

<210> 40 
<211> 23 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 40 

aacgcaatca atgtgtattt cgc 

<210> 41 

<211> 18 

<212> DNA 

<213> Synthetic sequence 



<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 41 

cccgacggca cataccat 



<210> 42 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 42 

gaaacgaaca acgcaacaca atct 24 



<210> 43 
<211> 22 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probes 



<400> 43 



caagctggac aatgtggaaa cc 



22 



<210> 44 
<211> 22 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probes 

<400> 44 

yagagaaacg aacaacgcaa ca 



<210> 45 
<211> 23 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 45 

ttgatctcga gcccggaact atg 23 



<210> 46 
<211> 22 



<212> DNA 

<213> Synthetic sequence 

<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probe 

<400> 46 

tctcgagccc ggaactatgg at 

<210> 47 
<211> 24 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probe 

<400> 47 

cccgaatttt ggagatgagg agag 

<210> 48 

<211> 20 

<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 



sequence: primer/hybridization probes 



<400> 48 

ggncartgyg gnaaycarat 

<210> 49 
<211> 20 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probes 

<400> 49 

gaytgyytnc arggnttyca 

<210> 50 
<211> 20 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence: primer/hybridization probe 

<400> 50 

tgraanccyt gnarrcartc 



<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic 

sequence : primer/hybridization probe 



<400> 51 

tcytgrtayt gytgrtaytc 



SEQUENCE LISTING 

<110> von samson-Himmel stjerna, Georg 
Harder, Achim 
Schneider, Thomas 
Pape, Michael a 

<120> dna Coding for Beta-Tubulin and Use Thereof 

<130> Mo6878/LeA 33 759 

<140> US 10/030,566 
<141> 2002-01-07 

<150> PCT/EP00/06104 
<151> 2000-06-30 

<150> DE19931883.2 
<151> 1999-07-09 

<160> 2 

<170> Patentln version 3.1 

<210> 1 

<211> 1380 

<212> DNA 

<213> Cyathostomum coronatum 

<220> 

<221> CDS 

<222> CI).. (1344) 

<223> 

<400> 1 

atg cgt gag ate gtg cat gta caa get gga caa tgt gga aac caa att 48 
Met Arg Glu lie Val His val Gin Ala Gly Gin cys Gly Asn Gin lie 
1 5 10 15 

ggt tec aag ttt tgg gaa gtg ate tct gac gag cat ggc att aag ccc 96 
Gly Ser Lys Phe Trp Glu val lie Ser Asp Glu His Gly lie Lys Pro 
20 25 30 

gat ggc aca tac cac gga gaa tct gat eta caa tta gaa cga ate aat 144 
Asp Gly Thr Tyr His Gly Glu ser Asp Leu Gin Leu Glu Arg lie Asn 
35 40 45 

gtg tac tat aat gaa gca cat gga ggc aaa tat gtc cca cgt gca gtt 192 
val Tyr Tyr Asn Glu Ala His Gly Gly Lys Tyr val Pro Arg Ala Val 
50 55 60 

ctt gtt gat etc gag ccc gga act atg gat tec gtc cgt tec ggg cca 240 
Leu val Asp Leu Glu Pro Gly Thr Met Asp Ser val Arg Ser Gly Pro 
65 70 75 80 

tac ggg caa ttg ttc cgc cct gat aac tac gtg ttt gga cag tct ggc 288 
Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr Val Phe Gly Gin Ser Gly 
85 90 95 

gca gga aat aac tgg gca aaa ggt cac tac act gaa ggc get gaa ctt 336 
Ala Gly Asn Asn Trp Ala Lys Gly His Tyr Thr Glu Gly Ala Glu Leu 
100 105 110 
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gtc gac aat gta eta gat gta gtg cga aaa gaa gca gaa gga tgt gac 
Val Asp Asn val Leu Asp val val Arg Lys Glu Ala Glu Gly cys Asp 
115 120 125 

tgt ctg cag ggc ttc cag eta act cac tea ctt gga gga ggt ace ggt 
Cys Leu Gin Gly Phe Gin Leu Thr His Ser Leu Gly Gly Gly Thr Gly 
130 135 140 

teg ggt atg ggc act etc etc ate tec aaa att egg gag gag tat cct 
Ser Gly Met Gly Thr Leu Leu lie Ser Lys lie Arg Glu Glu Tyr Pro 
145 150 155 160 

gat aga ate atg tec teg ttc tec gtt gtc ccc tea cca aag gtc tec 
Asp Arq lie Met Ser Ser Phe ser val Val Pro ser Pro Lys val ser 
165 170 175 

gac act gtt gtg gag cct tac aat get ace eta tec gtt cat cag ttg 
Asp Thr val Val Glu Pro Tyr Asn Ala Thr Leu Ser val His Gin Leu 
180 185 190 

gtt gaa aat aca gac gag act tat tgt att gac aat gaa gec ctg tat 
Val Glu Asn Thr Asp Glu Thr Tyr Cys lie Asp Asn Glu Ala Leu Tyr 
195 200 205 

gat att tgc ttc cgc act ttg aaa etc acg aac cca act tat gga gat 
Asp lie Cys Phe Arg Thr Leu Lys Leu Thr Asn Pro Thr Tyr Gly Asp 
210 215 220 

ctg aat cat ctt gtg tct gta aca atg tct ggt gtc ace aca tgt ctt 
Leu Asn His Leu Val ser val Thr Met ser Gly Val Thr Thr Cys Leu 
225 230 235 240 

cgc ttc cct ggc caa ttg aat gee gat eta cgc aaa eta get gtt aac 
Arq Phe Pro Gly Gin Leu Asn Ala Asp Leu Arg Lys Leu Ala val Asn 
245 250 255 

atg gtt cca ttc cct cgt ctt cac ttc ttc atg cct ggt ttt get cct 
Met Val Pro Phe Pro Arg Leu His Phe Phe Met Pro Gly Phe Ala Pro 
260 265 270 

ctt tct get aaa ggt get cag get tac cgt get ctt ace gta gee gag 
Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg Ala Leu Thr val Ala Glu 
275 280 285 

ctt aca cag cag atg ttt gat get aag aat atg atg get get tgc gac 
Leu Thr Gin Gin Met Phe Asp Ala Lys Asn Met Met Ala Ala Cys Asp 
290 295 300 

cct cga cat gga cgt tat etc ace gtc gca gee atg ttc cga gga aga 
Pro Arg His Gly Arg Tyr Leu Thr val Ala Ala Met Phe Arg Gly Arg 
305 310 315 320 

atg age atg agg gaa gta gac gac cag atg atg tea gtg cag aac aag 
Met ser Met Arg Glu val Asp Asp Gin Met Met Ser val Gin Asn Lys 
325 330 335 

aac tec tea tac ttc gta gag tgg ate ccg aac aac gtg aag acc get 
Asn Ser Ser Tyr Phe val Glu Trp lie Pro Asn Asn val Lys Thr Ala 
340 345 350 

gta tgc gac ate ccg cca cga gga ctg aag atg gee get acc ttc gtt 
Val Cys Asp lie Pro Pro Arg Gly Leu Lys Met Ala Ala Thr Phe val 
355 360 365 



gga aac tea act gec ate caa gag ctg ttc aag cgc att tea gaa caa 
Gly Asn Ser Thr Ala lie Gin Glu Leu Phe Lys Arg lie Ser Glu Gin 
370 375 380 

ttt aca gec atg ttc cgc cgc aaa gcg ttc ttg cat tgg tac act gqt 
Phe Thr Ala Met Phe Arg Arg Lys Ala Phe Leu His Trp Tyr Thr Gly 
385 390 395 400 

gaa gqt atg gac gag atg gag ttc act gaa gca gag tec aac atg aat 
Glu Gly Met Asp Glu Met Glu Phe Thr Glu Ala Glu ser Asn Met Asn 
405 410 415 

gat etc ate tec gag tac caa cag tac cag gaa gee acc get gac gac 
Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin Glu Ala Thr Ala Asp Asp 
420 42 5 430 

atg ggc gat ctt gat gcg gaa ggc get gaa gag get tat cct gag gaa 
Met Gly Asp Leu Asp Ala Glu Gly Ala Glu Glu Ala Tyr Pro Glu Glu 
435 440 445 

taaaccagca gatcgtgttg cgttgttcgt ttctct 

<210> 2 

<211> 448 

<212> PRT 

<213> Cyathostomum coronatum 

<400> 2 

Met Arg Glu lie Val His val Gin Ala Gly Gin Cys Gly Asn Gin lie 
1 5 10 15 

Gly ser Lys Phe Trp Glu val lie Ser Asp Glu His Gly lie Lys Pro 
20 25 30 

Asp Gly Thr Tyr His Gly Glu Ser Asp Leu Gin Leu Glu Arg lie Asn 
35 40 45 

val Tyr Tyr Asn Glu Ala His Gly Gly Lys Tyr Val Pro Arg Ala val 
50 55 60 

Leu val Asp Leu Glu Pro Gly Thr Met Asp Ser val Arg Ser Gly Pro 
65 70 75 80 

Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr val Phe Gly Gin ser Gly 
85 90 95 

Ala Gly Asn Asn Trp Ala Lys Gly His Tyr Thr Glu Gly Ala Glu Leu 
100 105 110 

Val Asp Asn val Leu Asp Val Val Arg Lys Glu Ala Glu Gly Cys Asp 
115 120 125 



Cys Leu Gin Gly Phe Gin Leu Thr His Ser Leu Gly Gly Gly Thr Gly 
130 135 140 
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Ser Gly Met Gly Thr Leu Leu lie Ser Lys lie Arg Glu Glu Tyr Pro 
145 150 155 160 

Asp Arg lie Met Ser Ser Phe ser val val Pro Ser Pro Lys Val Ser 
165 170 175 

Asp Thr val Val Glu Pro Tyr Asn Ala Thr Leu Ser val His Gin Leu 
180 185 190 

Val Glu Asn Thr Asp Glu Thr Tyr Cys lie Asp Asn Glu Ala Leu Tyr 
195 200 205 

Asp lie Cys Phe Arg Thr Leu Lys Leu Thr Asn Pro Thr Tyr Gly Asp 
210 215 220 

Leu Asn His Leu Val Ser val Thr Met Ser Gly Val Thr Thr Cys Leu 
225 230 235 240 

Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu Arg Lys Leu Ala val Asn 
245 250 255 

Met val Pro Phe Pro Arg Leu His Phe Phe Met Pro Gly Phe Ala Pro 
260 265 270 

Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg Ala Leu Thr val Ala Glu 

275 280 285 

Leu Thr Gin Gin Met Phe Asp Ala Lys Asn Met Met Ala Ala Cys Asp 
290 295 300 

pro Arg His Gly Arg Tyr Leu Thr val Ala Ala Met Phe Arg Gly Arg 
305 310 315 320 

Met Ser Met Arg Glu val Asp Asp Gin Met Met ser val Gin Asn Lys 
325 330 335 

Asn Ser Ser Tyr Phe Val Glu Trp lie Pro Asn Asn val Lys Thr Ala 
340 345 350 

val Cys Asp lie Pro Pro Arg Gly Leu Lys Met Ala Ala Thr Phe val 
355 360 365 

Gly Asn Ser Thr Ala lie Gin Glu Leu Phe Lys Arg lie Ser Glu Gin 
370 375 380 

Phe Thr Ala Met Phe Arg Arg Lys Ala Phe Leu His Trp Tyr Thr Gly 
385 390 395 400 



Glu Gly Met Asp Glu Met G~lu Phe Thr Glu Ala G"lu Ser Asn Met Asn 
405 410 415 

Asp Leu He Ser Glu Tyr Gin Gin Tyr Gin Glu Ala Thr Ala Asp Asp 
420 425 430 

Met Gly Asp Leu Asp Ala Glu Gly Ala Glu Glu Ala Tyr Pro Glu Glu 
435 440 445 



<210> 
<211> 
<212> 



<220> 
<221> 
<222> 
<223> 



3 

1429 
DNA 



<213> Cylicocyclus nassatus 



CDS 

CI). .(1362) 

xaa is a variable amino acid. 



<400> 3 

aag ttc tct act gca ata atg cgt gag ate gtg cat gta caa get gga 

Lys Phe Ser Thr Ala He Met Arg Glu He val His Val Gin Ala Gly 
1 5 10 15 

car tgt gga aac caa att ggy tec aag tty tgg gaa gtg ate tct gac 
Gin Cys Gly Asn Gin lie xaa ser Lys Phe Trp Glu val He ser Asp 
20 25 30 

gag cac ggc att aag ccy gay ggc aca tac cay gga gaa tct gay yta 
Glu His Gly lie Lys Xaa Asp Gly Thr Tyr His Gly Glu Ser Asp Xaa 
35 40 45 

caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aar 
Gin Leu Glu Arg lie Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr val Pro Arg Ala Val Leu val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

tcr gtc cgy tcy ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 
Xaa val Arg xaa Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 

gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 
val Phe Gly Gin ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

act gaa ggy get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 
Thr Glu xaa Ala Glu Leu Val Asp Asn val Leu Asp val vat Arg Lys 



gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 
Glu Ala Glu Gly cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
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ctt gga gga ggt acc gga teg rgt atg ggc acw etc etc ate tyc aaa 480 
Leu Gly Gly Gly Thr Gly Ser Xaa Met Gly xaa Leu Leu He xaa Lys 
145 150 155 160 

att egg gag gag tat cct gat aga ate atr tec teg ttc tyc gtt gtt 528 
lie Arg Glu Glu Tyr Pro Asp Arg He xaa ser ser Phe Xaa Val Val 
165 170 175 

ccc tea cca aag gtc tyc gay acy gtt gtg gag ccg tac aat get acc 576 
Pro Ser Pro Lys Val Xaa Asp Xaa Val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

eta tec gtt cat cag ttg gtt gaa aat aca gac gar act twe tgt att 624 
Leu Ser val His Gin Leu val Glu Asn Thr Asp Glu Thr xaa Cys lie 
195 200 205 

gac aat gaa get ctt tat gat att tgc ttc cgc acy ytg aaa etc acs 672 
Asp Asn Glu Ala Leu Tyr Asp He cys Phe Arg xaa xaa Lys Leu xaa 
210 215 220 

aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 720 
Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val ser val Thr Met Ser 
225 230 235 240 

ggy gtc act aca tgy ctt cgc ttc cct ggc caa ttg rry gee gat ctw 768 
Xaa val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu xaa Ala Asp Xaa 
245 250 255 

cgt aaa eta get gtt aac atg gyt cca ttc cct cgt ctt cac tty tty 816 
Arg Lys Leu Ala val Asn Met Xaa Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct gey aaa ggc gey cag get tac cgt 864 
Met Pro Gly Phe Ala Pro Leu ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

get ctt act gta gee gag ctw acy caa yag atg ttc gat gec aaa aat 912 
Ala Leu Thr val Ala Glu xaa Xaa Gin xaa Met Phe Asp Ala Lys Asn 
290 295 300 

atg atg gee get tgc gac cct cga cat gga crt tat etc acc gty gca 960 
Met Met Ala Ala Cys Asp Pro Arg His Gly xaa Tyr Leu Thr Xaa Ala 
305 310 315 320 

gee atg ttc cga gga cga atg age ayg agg gar gta gac gac cag atg 1008 
Ala Met Phe Arg Gly Arg Met Ser xaa Arg Glu val Asp Asp Gin Met 
325 330 . 335 

atg tea gtg cag aac aag aac tec tea tac ttc gta gag tgg att ccg 1056 
Met Ser val Gin Asn Lys Asn Ser ser Tyr Phe Val Glu Trp lie Pro 
340 345 350 

aac aac gtc aar acc gey gta tgc gac att ccg ccr aga gga ctg aaa 1104 
Asn Asn val Lys Thr Ala Val Cys Asp He Pro xaa Arg Gly Leu Lys 
355 360 365 

atg gee get acc ttc gtt gga aac yea act gee aty caa gag ctg tty 1152 
Met Ala Ala Thr Phe val Gly Asn xaa Thr Ala Xaa Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa tty aca get atg ttc cgc cgc aaa gcg tty 1200 
Lys Arg He Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 
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ttg cat ygg tay act ggw gaa ggt atg gay gag atg gag ttc act gaa 
Leu His xaa Tyr Thr xaa Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gcc gag tec aac atg aat gat etc ate tec gaa tac car caa tac cag 
Ala Glu Ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

gaa get acm get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
Glu Ala xaa Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gar gaa tagamcagca gaytgtgttg cgttgttcgt 
Glu Ala Tyr Pro Glu Glu 
450 

ttctctrtgt caatgegaaa tacacattga ttgcgtt 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



4 

454 
PRT 

Cylicocyclus nassatus 



mi sc_feature 
(23).. (23) 

The 'Xaa' at location 23 stands for Gly. 

mi sc_feature 
(38). .(38) 

The 'xaa' at location 38 stands for Pro. 



mi sc_feature 
(48) . . (48) 

The 'xaa' at location 48 stands for Leu. 



misc_feature 
(81). .(81) 

The 'xaa' at location 81 stands for Ser. 



misc_feature 
(84) . . (84) 

The 'xaa' at location 84 stands for ser. 



misc_feature 
(115).. (115) 

The 'xaa' at location 115 stands for Gly. 

mi sc_feature 
(152). .(152) 

The 'Xaa' at location 152 stands for Gly, or ser. 



misc_feature 
(155).. (155) 

The 'xaa' at location 155 stands for Thr. 



<220> 

<221> mi sc_f eature 

<222> (159).. (159) 

<223> The 'Xaa' at "location 159 stands 

<220> 

<221> mi sc_f eature 

<222> (170) . . (170) 

<223> The 'xaa' at location 170 stands 

<220> 

<221> mi sc_f eature 

<222> (174) . . (174) 

<223> The "Xaa" at location 174 stands 
<220> 

<221> mi sc_f eature 

<222> (182).. (182) 

<223> The 'xaa' at location 182 stands 

<220> 

<221> misc_feature 

<222> (184) . . (184) 

<223> The 'Xaa' at location 184 stands 

<220> 

<221> mi sc_f eature 

<222> (206).. (206) 

<223> The 'Xaa' at location 206 stands 
<220> 

<221> mi sc_f eature 

<222> (220).. (220) 

<223> The 'Xaa' at location 220 stands 
<220> 

<221> misc_feature 

<222> (221).. (221) 

<223> The 'xaa' at location 221 stands 
<220> 

<221> misc_feature 

<222> (224).. (224) 

<223> The 'xaa' at location 224 stands 
<220> 

<221> misc_feature 

<222> (241).. (241) 

<223> The 'xaa' at location 241 stands 

<220> 

<221> mi sc_f eature 

<222> (253).. (253) 

<223> The 'xaa' at location 253 stands 
<220> 

<221> mi sc_f eature 

<222> (256).. (256) 

<223> The 'xaa' at location 256 stands 
<220> 

<221> misc_feature 

<222> (264) . . (264) 

<223> The 'xaa' at location 264 stands 



for ser, or Phe. 



for Met, or lie. 



for ser, or Phe. 



for Ser, or Phe. 



for Thr. 



for Tyr, or Phe. 



for Thr. 



for Leu. 



for Thr. 



for Gly. 



for Gly, Asp, Ser, or Asn. 



for Leu. 



for Ala, or Val . 
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<220> 

<221> misc_feature 

<222> (295).. (295) 

<223> The 'xaa' at location 295 stands for Leu. 
<220> 

<221> misc_feature 

<222> (296) . . (296) 

<223> The 'xaa' at location 296 stands for Thr. 

<220> 

<221> misc_feature 

<222> (298).. (298) 

<223> The 'Xaa' at location 298 stands for Gin. 
<220> 

<221> misc_feature 

<222> (315).. (315) 

<223> The 'xaa' at location 315 stands for Arg, or His. 
<220> 

<221> misc_feature 

<222> (319).. (319) 

<223> The 'xaa' at location 319 stands for val . 
<220> 

<221> mi sc_ feature 

<222> (329).. (329) 

<223> The 'xaa' at location 329 stands for Thr, or Met. 

<220> 

<221> misc_feature 

<222> (364).. (364) 

<223> The 'xaa' at location 364 stands for Pro. 

<220> 

<221> misc_feature 

<222> (377).. (377) 

<223> The 'xaa' at location 377 stands for Pro, or ser. 
<220> 

<221> misc_feature 

<222> (380) . . (380) 

<223> The 'xaa' at location 380 stands for lie. 
<220> 

<221> misc_feature 

<222> (403).. (403) 

<223> The 'xaa' at location 403 stands for Arg, or Trp. 
<220> 

<221> misc_feature 

<222> (406) . . (406) 

<223> The 'xaa' at location 406 stands for Gly. 
<220> 

<221> misc_feature 

<222> (435).. (435) 

<223> The 'xaa' at location 435 stands for Thr. 

<400> 4 
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Lys Phe Ser Thr Ala lie Met Arg Glu lie Val His val Gin Ala Gly 
1 5 10 15 

Gin Cys Gly Asn Gin lie xaa Ser Lys Phe Trp Glu val lie Ser Asp 
20 25 30 

Glu His Gly He Lys xaa Asp Gly Thr Tyr His Gly Glu ser Asp Xaa 
35 40 45 

Gin Leu Glu Arg lie Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

Tyr Val Pro Arg Ala val Leu val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

xaa val Arg xaa Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 

val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

Thr Glu xaa Ala Glu Leu val Asp Asn val Leu Asp Val val Arg Lys 
115 120 125 

Glu Ala Glu Gly cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 

Leu Gly Gly Gly Thr Gly Ser xaa Met Gly Xaa Leu Leu lie xaa Lys 
145 150 155 160 

lie Arg Glu Glu Tyr Pro Asp Arg lie xaa Ser ser Phe Xaa Val val 
165 170 175 

Pro Ser Pro Lys Val Xaa Asp xaa val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

Leu ser val His Gin Leu val Glu Asn Thr Asp Glu Thr xaa Cys lie 
195 200 205 

Asp Asn Glu Ala Leu Tyr Asp lie cys Phe Arg xaa Xaa Lys Leu xaa 
210 215 220 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser val Thr Met Ser 
225 230 235 240 

Xaa val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Xaa Ala Asp Xaa 
245 250 255 
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Arg Lys Leu Ala val Asn Met xaa Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

Ala Leu Thr val Ala G"lu xaa xaa Gin Xaa Met Phe Asp Ala Lys Asn 
290 295 300 

Met Met Ala Ala cys Asp Pro Arg His Gly Xaa Tyr Leu Thr xaa Ala 
305 310 315 320 

Ala Met Phe Arg Gly Arg Met Ser xaa Arg Glu Val Asp Asp Gin Met 
325 330 335 

Met Ser val Gin Asn Lys Asn ser Ser Tyr Phe val Glu Trp lie Pro 
340 345 350 

Asn Asn val Lys Thr Ala val Cys Asp lie Pro Xaa Arg Gly Leu Lys 
355 360 365 

Met Ala Ala Thr Phe val Gly Asn xaa Thr Ala Xaa Gin Glu Leu Phe 
370 375 380 

Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 

Leu His xaa Tyr Thr xaa Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

Ala Glu Ser Asn Met Asn Asp Leu lie ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

Glu Ala xaa Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

Glu Ala Tyr Pro Glu Glu 
450 



<210> 5 

<211> 1428 

<212> DNA 

<213> Cylicocyclus nassatus 
<220> 

<221> CDS 

<222> (D (1362) 

<223> 
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Lys Phe ser Thr Ala lie Met Arg Glu lie val His val Gin Ala Gly 

15 10 15 

cag tgt gga aac caa att ggc tec aag ttt tgg gaa gtg ate tct gac 

Gin Cys Gly Asn Gin lie Gly Ser Lys Phe Trp Glu val lie ser Asp 

20 25 30 

gag cac ggc att aag cct gat ggc aca tac cac gga gaa tct gat tta 

Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 

caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aaa 

Gin Leu Glu Arg lie Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 

Tyr val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 

65 70 75 80 

teg gtc cgt tec ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 

Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 

85 90 95 

gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 

val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 

100 105 110 

act gaa ggc get gaa ctt gtt gac aat gta eta gat gta gtg cga aaa 

Thr Glu Gly Ala Glu Leu val Asp Asn Val Leu Asp Val val Arg Lys 
115 120 125 

gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 

Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 

ctt gga gga ggt ace gga teg ggt atg ggc act etc etc ate tec aaa 

Leu Gly Gly Gly Thr Gly Ser Gly Met Gly Thr Leu Leu lie Ser Lys 

145 150 155 160 

att egg gag gag tat cct gat aga ate atg tec teg ttc tec gtt gtt 

lie Arg Glu Glu Tyr Pro Asp Arg lie Met Ser Ser Phe ser val val 

165 170 175 

ccc tea cca aag gtc tec gac ace gtt gtg gag ccg tac aat get acc 

Pro ser Pro Lys val Ser Asp Thr val val Glu Pro Tyr Asn Ala Thr 

180 185 190 

eta tec gtt cat cag ttg gtt gaa aat aca gac gag act ttc tgt att 

Leu Ser Val His Gin Leu val Glu Asn Thr Asp Glu Thr Phe Cys lie 
195 200 205 

gac aat gaa get ctt tat gat att tgc ttc cgc act ttg aaa etc acg 

Asp Asn Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val Ser val Thr Met Ser 

225 230 235 240 

ggt gtc act aca tgt ctt cgc ttc cct ggc caa ttg aat gee gat eta 

Gly Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
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cgt aaa eta get gtt aac atg gtt cca ttc cct cgt ctt cac ttc ttt 
Arg Lys Leu Ala val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct get aaa ggc get cag get tac cgt 
Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

get ctt act gta gee gag eta act caa cag atg ttc gat gee aaa aat 
Ala Leu Thr val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

atg atg gee get tgc gac cct cga cat gga cgt tat etc ace gtc gca 
Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 315 320 

gee atg ttc cga gga cga atg age atg agg gag gta gac gac cag atg 
Ala Met Phe Arg Gly Arg Met ser Met Arg Glu val Asp Asp Gin Met 
325 330 335 

atg tea gtg cag aac aag aac tec tea tac ttc gta gag tgg att ccg 
Met Ser val Gin Asn Lys Asn Ser Ser Tyr Phe Val Glu Trp lie Pro 
340 345 350 

aac aac gtc aag ace get gta tgc gac att ccg ccg aga gga ctg aaa 
Asn Asn val Lys Thr Ala val Cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

atg gee get ace ttc gtt gga aac tea act gee ate caa gag ctg ttc 
Met Ala Ala Thr Phe val Gly Asn Ser Thr Ala lie Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa ttc aca get atg ttc cgc cgc aaa gcg ttc 
Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 

ttg cat tgg tat act ggt gaa ggt atg gac gag atg gag ttc act gaa 
Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gee gag tec aac atg aat gat etc ate tec gaa tac cag caa tac cag 
Ala Glu Ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 
420 42 5 430 

gaa get aca get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gaa gaa tagacagcag attgtgttgc gttgttcgtt 
Glu Ala Tyr Pro Glu Glu 
450 

tctctgtgtc aatgcgaaat acacattgat tgcgtt 

<210> 6 

<211> 454 

<212> PRT 

<213> Cylicocyclus nassatus 

<400> 6 



.:■!!.-.. :::...!! '-S1.V..I , . •'!.< < : 7^ IC.V TJ ,,; k.,v „k «,'.,]: M;.:ir2'..r 



Lys Phe Ser Thr Ala He Met Arg Glu lie val His Val Gin Ala Gly 
1 5 10 15 

Gin cys Gly Asn Gin lie Gly Ser Lys Phe Trp Glu val lie Ser Asp 
20 25 30 

Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu ser Asp Leu 
35 40 45 

Gin Leu Glu Arg lie Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

Ser val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 

val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

Thr Glu Gly Ala Glu Leu val Asp Asn val Leu Asp val Val Arg Lys 
115 120 125 

Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 

Leu Gly Gly Gly Thr Gly Ser Gly Met Gly Thr Leu Leu lie Ser Lys 
145 150 155 160 

lie Arg Glu Glu Tyr Pro Asp Arg lie Met Ser ser Phe Ser val val 
165 170 175 

Pro Ser Pro Lys Val ser Asp Thr Val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

Leu Ser val His Gin Leu Val Glu Asn Thr Asp Glu Thr Phe Cys lie 
195 200 205 

Asp Asn Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu Val Ser Val Thr Met Ser 
225 230 235 240 

Gly val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
245 250 255 



Arg Lys Leu Ala Val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

Met Pro Gly Phe Ala Pro Leu ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

Ala Leu Thr val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 315 320 

Ala Met Phe Arg Gly Arg Met Ser Met Arg Glu Val Asp Asp Gin Met 
325 330 335 

Met ser val Gin Asn Lys Asn ser ser Tyr Phe val Glu Trp lie Pro 
340 345 350 

Asn Asn val Lys Thr Ala val Cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

Met Ala Ala Thr Phe val Gly Asn Ser Thr Ala lie Gin Glu Leu Phe 
370 375 380 

Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 

Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

Ala Glu Ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
43 5 440 445 

Glu Ala Tyr Pro Glu Glu 
450 



<210> 7 

<211> 1429 

<212> DNA 

<213> Cylicocyclus nassatus 



<220> 

<221> CDS 

<222> (1) . . (1362) 

<223> 

<400> 7 
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aag ttc tct act gca ata atg cgt gag ate gtg cat gta caa get gga 

Lys Phe Ser Thr Ala lie Met Arg Glu lie val His val Gin Ala Gly 

15 10 15 

cag tgt gga aac caa att gqt tec aag ttt tgg gaa gtg ate tct gac 

Gin Cys Gly Asn Gin lie Gly Ser Lys Phe Trp Glu val lie Ser Asp 
20 25 30 

gag cac ggc att aag ccc gat ggc aca tac cac gga gaa tct gac tta 

Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 

caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aaa 

Gin Leu Glu Arg lie Asn Val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 

Tyr Val Pro Arg Ala Val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 

65 70 75 80 

teg gtc cgt tec ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 

Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 

85 90 95 

gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa gqt cac tac 

val Phe Gly Gin ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

act gaa ggc get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 

Thr Glu Gly Ala Glu Leu Val Asp Asn Val Leu Asp Val Val Arg Lys 
115 120 125 

gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 

Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His Ser 
130 135 140 

ctt gga gga ggt acc gga teg agt atg ggc act etc etc ate ttc aaa 

Leu Gly Gly Gly Thr Gly ser Ser Met Gly Thr Leu Leu lie Phe Lys 

145 150 155 160 

att egg gag gag tat cct gat aga ate ata tec teg ttc ttc gtt gtt 

lie Arg Glu Glu Tyr Pro Asp Arg lie lie Ser Ser Phe Phe val val 

165 170 175 

ccc tea cca aag gtc tec gac acc gtt gtg gag ccg tac aat get acc 

Pro Ser Pro Lys Val Ser Asp Thr Val Val Glu Pro Tyr Asn Ala Thr 
180 185 190 

eta tec gtt cat cag ttg gtt gaa aat aca gac gag act ttc tgt att 

Leu ser val His Gin Leu val Glu Asn Thr Asp Glu Thr Phe Cys lie 
195 200 205 

gac aat gaa get ctt tat gat att tgc ttc cgc act ttg aaa etc acg 

Asp Asn Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val Ser val Thr Met Ser 

225 230 235 240 

ggt gtc act aca tgt ctt cgc ttc cct ggc caa ttg agt gee gat eta 

Gly val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Ser Ala Asp Leu 
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cgt aaa eta get gtt aac atg gtt cca ttc cct cgt ctt cac ttc ttt 
Arg Lys Leu Ala val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct get aaa ggc get cag get tac cgt 
Met Pro Gly Phe Ala Pro Leu ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

get ctt act gta gec gag eta act caa cag atg ttc gat gec aaa aat 
Ala Leu Thr val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

atg atg gec get tgc gac cct cga cat gga cgt tat etc ace gtc gca 
Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 315 320 

gec atg ttc cga gga cga atg age atg agg gag gta gac gac cag atg 
Ala Met Phe Arg Gly Arg Met ser Met Arg Glu Val Asp Asp Gin Met 
325 330 335 

atg tea gtq cag aac aag aac tec tea tac ttc gta gag tgg att ccg 
Met Ser Val Gin Asn Lys Asn Ser Ser Tyr Phe Val Glu Trp lie Pro 
340 345 350 

aac aac gtc aag ace get gta tgc gac att ccg ccg aga gga ctg aaa 
Asn Asn val Lys Thr Ala val Cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

atg gec get ace ttc gtt gga aac tea act gee att caa gag ctg ttc 
Met Ala Ala Thr Phe Val Gly Asn Ser Thr Ala lie Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa ttt aca get atg ttc cgc cgc aaa gcg ttc 
Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 ' 400 

ttg cat tgg tat act ggt gaa ggt atg gac gag atg gag ttc act gaa 
Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gee gag tec aac atg aat gat etc ate tec gaa tac caa caa tac cag 
Ala Glu Ser Asn Met Asn Asp Leu lie ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

gaa get ace get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gag gaa tagaacagca gattgtgttg cgttgttcgt 
Glu Ala Tyr Pro Glu Glu 
450 

ttctctgtgt caatgegaaa tacacattga ttgcgtt 

<210> 8 
<211> 454 
<212> PRT 

<213> Cylicocyclus nassatus 
<400> 8 

Lys Phe ser Thr Ala lie Met Arg Glu lie val His val Gin Ala Gly 
15 10 15 
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Gin cys Gly Asn Gin lie Gly ser Lys Phe Trp Glu val lie Ser Asp 
20 25 30 

Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 

Gin Leu Glu Arg lie Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

Tyr Val Pro Arg Ala val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

Ser Val Arg Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 

val Phe Gly Gin ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

Thr Glu Gly Ala Glu Leu val Asp Asn val Leu Asp Val Val Arg Lys 
115 120 125 

Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His ser 
130 135 140 

Leu Gly Gly Gly Thr Gly Ser Ser Met Gly Thr Leu Leu lie Phe Lys 
145 150 155 160 

lie Arg Glu Glu Tyr Pro Asp Arg lie lie Ser ser Phe Phe val Val 
165 170 175 

Pro Ser Pro Lys Val Ser Asp Thr val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

Leu ser val His Gin Leu Val Glu Asn Thr Asp Glu Thr Phe Cys lie 
195 200 205 

Asp Asn. Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val Ser Val Thr Met Ser 
225 230 235 240 

Gly val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Ser Ala Asp Leu 
245 250 255 

Arg Lys Leu Ala Val Asn Met val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 
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Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

Ala Leu Thr val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 315 320 

Ala Met Phe Arg Gly Arg Met Ser Met Arg Glu val Asp Asp Gin Met 
325 330 335 

Met Ser val Gin Asn Lys Asn ser Ser Tyr Phe val Glu Trp lie Pro 
340 345 350 

Asn Asn val Lys Thr Ala val cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

Met Ala Ala Thr Phe val Gly Asn Ser Thr Ala He Gin Glu Leu Phe 
370 375 380 

Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 

Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

Ala Glu ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

Glu Ala Tyr Pro Glu Glu 
450 



<210> 9 

<211> 1428 

<212> DNA 

<213> Cylicocyclus nassatus 
<220> 

<221> CDS 

<222> (1)..(1362) 

<223> 

<400> 9 



aag ttc tct act gca ata atg cgt gag ate gtg cat gta caa get gga 
Lys Phe Ser Thr Ala lie Met Arg Gl u lie Val His val Gin Ala Gly 
15 10 15 

cag tgt gga aac caa att ggt tec aag ttc tgg gaa gtg ate tct gac 
Gin cys Gly Asn Gin He Gly Ser Lys Phe Trp Glu Val He Ser Asp 
20 25 30 

gag cac ggc att aag ccc gac ggc aca tac cat gga gaa tct gat eta 
Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu Ser Asp Leu 
35 40 45 

caa tta gaa cga ate aat gtg tac tat aat gaa gca cat gga ggc aag 
Gin Leu Glu Arg He Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

tat gtc ccg cgt gca gtt ctt gtt gat etc gag ccc gga act atg gat 
Tyr Val Pro Arg Ala Val Leu val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

tea gtc cgt tct ggg cca tac ggg caa ttg ttc cgc cct gat aac tac 
Ser val Arq Ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 90 95 

gtg ttt gga cag tct ggc gca gga aat aac tgg gca aaa ggt cac tac 
Val Phe Gly Gin Ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 HO 

act gaa ggc get gaa ctt gtc gac aat gta eta gat gta gtg cga aaa 
Thr Glu Gly Ala Glu Leu val Asp Asn Val Leu Asp val val Arg Lys 
115 120 125 

gaa get gaa gga tgt gac tgt ctg cag ggc ttc cag eta act cac tea 
Glu Ala Glu Gly Cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His ser 
130 135 140 

ctt gga gga ggt ace gga teg ggt atg ggc aca etc etc ate tec aaa 
Leu Gly Gly Gly Thr Gly ser Gly Met Gly Thr Leu Leu lie Ser Lys 
145 150 155 160 

att egg gag gag tat cct gat aga ate atg tec teg ttc tec gtt gtt 
He Arq Glu Glu Tyr Pro Asp Arg lie Met ser Ser Phe Ser val val 
165 170 175 

ccc tea cca aag gtc ttc gat act gtt gtg gag ccg tac aat get acc 
Pro Ser Pro Lys val Phe Asp Thr val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

eta tec gtt cat cag ttg gtt gaa aat aca gac gag act ttc tgt att 
Leu ser val His Gin Leu val Glu Asn Thr Asp Glu Thr Phe Cys He 
195 200 205 

gac aat gaa get ctt tat gat att tgc ttc cgc acc ttg aaa etc acg 
Asp Asn Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 220 

aac cca act tat gga gat ctg aat cat ctt gtg tct gta aca atg tct 
Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val Ser Val Thr Met ser 
225 230 235 240 

ggt gtc act aca tgc ctt cgc ttc cct ggc caa ttg aat gec gat eta 
Gly val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
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cat aaa eta get gtt aac atg gtt cca ttc cct cgt ctt cac ttc ttc 
Arg Lys Leu Ala Val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 

atg cct ggc ttt get ccc etc tct gee aaa ggc gee cag get tac cgt 
Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 

get ctt act gta gee gag eta act caa cag atg ttc gat gee aaa aat 
Ala Leu Thr val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

atg atg gee get tgc gac cct cga cat gga cgt tat etc ace gtc gca 
Met Met Ala Ala Cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 315 320 

gee atg ttc cga gga cga atg age atg agg gag gta gac gac cag atg 
Ala Met Phe Arg Gly Arg Met Ser Met Arg Glu Val Asp Asp Gin Met 
325 330 335 

atg tea gtg cag aac aag aac tec tea tac ttc gta gag tgg att ccg 
Met ser val Gin Asn Lys Asn Ser ser Tyr Phe val Glu Trp He Pro 
340 345 350 

aac aac gtc aaa ace get gta tgc gac att ccg ccg aga gga ctg aaa 
Asn Asn val Lys Thr Ala val Cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

atg gec get ace ttc gtt gga aac tea act gee att caa gag ctg ttc 
Met Ala Ala Thr Phe val Gly Asn Ser Thr Ala lie Gin Glu Leu Phe 
370 375 380 

aag cgc att tea gaa caa ttc aca get atg ttc cgc cgc aaa gcg ttc 
Lys Arq lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 400 

ttg cat tgg tat act ggt gaa ggt atg gac gag atg gag ttc act gaa 
Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

gee gag tec aac atg aat gat etc ate tec gaa tac cag caa tac cag 
Ala Glu ser Asn Met Asn Asp Leu He ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

gaa get acc get gac gat atg ggc gat etc gat gcg gaa ggc get gaa 
Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

gag get tac cct gaa gaa tagacagcag attgtgttgc gttgttcgtt 
Glu Ala Tyr Pro Glu Glu 
450 

tctctgtgtc aatgcgaaat acacattgat tgcgtt 

<210> 10 

<211> 454 

<212> PRT 

<213> Cylicocyclus nassatus 

<400> 10 

Lys Phe ser Thr Ala lie Met Arg Glu lie val His Val Gin Ala Gly 
1 5 10 15 



Gin cys Gly Asn Gin lie Gly Ser Lys Phe Trp Glu Val lie Ser Asp 
20 25 30 

Glu His Gly lie Lys Pro Asp Gly Thr Tyr His Gly Glu ser Asp Leu 
35 40 45 

Gin Leu Glu Arg lie Asn val Tyr Tyr Asn Glu Ala His Gly Gly Lys 
50 55 60 

Tyr Val Pro Arg Ala val Leu Val Asp Leu Glu Pro Gly Thr Met Asp 
65 70 75 80 

Ser val Arg ser Gly Pro Tyr Gly Gin Leu Phe Arg Pro Asp Asn Tyr 
85 ~ 90 95 

val Phe Gly Gin ser Gly Ala Gly Asn Asn Trp Ala Lys Gly His Tyr 
100 105 110 

Thr Glu Gly Ala Glu Leu Val Asp Asn val Leu Asp Val val Arg Lys 
115 120 125 

Glu Ala Glu Gly cys Asp Cys Leu Gin Gly Phe Gin Leu Thr His ser 
130 * " 135 140 

Leu Gly Gly Gly Thr Gly ser Gly Met Gly Thr Leu Leu lie ser Lys 
145 150 155 160 

lie Arg Glu Glu Tyr Pro Asp Arg lie Met Ser ser Phe Ser val val 
165 170 175 

Pro Ser Pro Lys val Phe Asp Thr Val val Glu Pro Tyr Asn Ala Thr 
180 185 190 

Leu Ser val His Gin Leu val Glu Asn Thr Asp Glu Thr Phe Cys lie 
195 200 205 

Asp Asn Glu Ala Leu Tyr Asp lie Cys Phe Arg Thr Leu Lys Leu Thr 
210 215 ' 220 

Asn Pro Thr Tyr Gly Asp Leu Asn His Leu val Ser val Thr Met ser 
225 230 235 240 

Gly Val Thr Thr Cys Leu Arg Phe Pro Gly Gin Leu Asn Ala Asp Leu 
245 ~ 250 255 

Arg Lys Leu Ala val Asn Met Val Pro Phe Pro Arg Leu His Phe Phe 
260 265 270 



Met Pro Gly Phe Ala Pro Leu Ser Ala Lys Gly Ala Gin Ala Tyr Arg 
275 280 285 



Ala Leu Thr Val Ala Glu Leu Thr Gin Gin Met Phe Asp Ala Lys Asn 
290 295 300 

Met Met Ala Ala cys Asp Pro Arg His Gly Arg Tyr Leu Thr val Ala 
305 310 ~ 315 320 

Ala Met Phe Arg Gly Arg Met Ser Met Arg Glu val Asp Asp Gin Met 
325 330 335 

Met Ser Val Gin Asn Lys Asn Ser Ser Tyr Phe Val Glu Trp lie Pro 
340 345 350 

Asn Asn val Lys Thr Ala val cys Asp lie Pro Pro Arg Gly Leu Lys 
355 360 365 

Met Ala Ala Thr Phe Val Gly Asn Ser Thr Ala He Gin Glu Leu Phe 
370 375 380 

Lys Arg lie Ser Glu Gin Phe Thr Ala Met Phe Arg Arg Lys Ala Phe 
385 390 395 " 400 

Leu His Trp Tyr Thr Gly Glu Gly Met Asp Glu Met Glu Phe Thr Glu 
405 410 415 

Ala Glu Ser Asn Met Asn Asp Leu lie Ser Glu Tyr Gin Gin Tyr Gin 
420 425 430 

Glu Ala Thr Ala Asp Asp Met Gly Asp Leu Asp Ala Glu Gly Ala Glu 
435 440 445 

Glu Ala Tyr Pro Glu Glu 
450 



<210> 11 

<211> 2655 

<212> DNA 

<213> Cylicocyclus nassatus 
<220> 

<221> intron 

<222> (1) . . (18) 
<223> 

<220> 

<221> Intron 



<222> (76)-- (358) 
<223> 

<220> 

<221> intron 
<222> (496).. (637) 
<223> 

<220> 

<221> intron 

<222> (865).. (1374) 

<223> 

<220> 

<221> Intron 

<222> (1666). .(1723) 

<223> 

<220> 

<221> intron 

<222> (1915) . . (1966) 

<223> 

<220> 

<221> Intron 

<222> (2064) .. (2119) 

<223> 

<220> 

<221> intron 

<222> (2306). .(2354) 

<223> 

<220> 

<221> intron 

<222> (2475). .(2523) 

<223> 

<220> 

<221> intron 

<222> (2592). .(2655) 

<223> 

<400> 11 n*-„r 3 i-ni- a r ^nrtaaaca rtataaaaac 60 

120 
180 
240 



aag«ctcta ctgcaataat gcgtgagatc gtgcatgtac aagctggaca rtgtggaaac 
caaattggtt ccaaggtrcg gtagtttyrr twktytrytg atcgtaattc sggtngytytr 
dagtrryttt ttycgytgyy ratgttgcat yrtgttgcga taaascyraa aawtcawwag 
rcgaggctgt aaaagsactt ytacttttra atmcrytgta gcagcatgag tcatcrgcat 
gtttgcagtg sgttttttat gcgcwgawcc yctcagaaga tgagaatgcg wtccaytgag 
cwtagartct grctttctct cgttawctaa ratcaamtta carcrytyca ttttkcagtt 
ytgggaagtg atctctgacg agcacggcat taagccygay ggcacatacc ayggagaatc 
tgatytacaa ttagaacgaa tcaatgtgta ctataatgaa gcacatggtt agtcgtayat 
ccgcttcgtt gtytcccmat gcagrccyct tcagttttta taactgycga aatatcgatc 
gggctctttt gcagcggccw ygattacgca ataccayygc ygcygcagtg gcrgtcgaaa 



300 
360 
420 
480 
540 
600 



ttaatgtggt 


caracgtgaa 


aatgtggtgc 


tttyaggagg 


caar"ta"tg"tc 


ccgcgtgcag 




ttcttgttga 


txtcgagccc 


ggaactatgg 


attcggtccg 


ytccgggcca 


tacgggcaat 




tgttccgccc 


tgataactac 


gtgtttggac 


agtctggcgc 


aggaaataac 


tgggcaaaag 




gtcactacac 


tgaaggygct 


gaacttgtcg 


acaatgtact 


agatgtagtg 


cgaaaagaag 




ctgaaggatg 


tgactgtctg 


caggtaaatt 


tccaagtagt 


agcaggaaat: 


ggtwtgtgra 




tagcataaca 


aaagtcatag 


aaggaatatg 


gacgctagtc 


aaaacaaagw 


tggacgttar 




tcggtcgtcc 


gggacarttt 


ggaagtcatg 


ggtcasccaa 


cacgcttttt 


tamaagtaca 




tcatactctt 


ttcccacgaa 


aagctatttt 


gcgtattacg 


gggtacaggg 


gaggggtcaa 


1080 


aatcacagat 


tgctgaaaty 


tggttcactg 


ragttattgr 


tgaaaatcat 


attgattttg 


1140 


cttgctactg 


ccttcttttr 


aggctatgct 


ttacaatctt 


ggggcctgga 


taaccgaatt 


1200 


gtcygaagtt 


tttcggtcat 


cacggacggg 


gaaggggcat 


artatcgtta 


kttcttgkta 


1260 


tttcgcagca 


tatggcaatc 


tytccacttc 


tgacaagttt 


tcygtagaaa 


atatwcttca 


1320 


aggtstcaag 


aacyttgctg 


ctagrgctgt 


aaaccaayct 


gtatcycttt 


cagggcttcc 


1380 


agctaactca 


ctcacttgga 


ggaggtaccg 


gatcgggtat 


gggcactctc 


ctcatctcca 


1440 


aaattcggga 


ggagtatcct 


gatagaatca 


tgtcctcgtt 


ctccgttgtt 


ccctcaccaa 




aggtctccga 


caccgttgtg 


gagccgtaca 


atgctaccct 


atccgttcat 


cagttggttg 




aaaatacaga 


cgaracttwc 


tgtattgaca 


atgaagctct 


ttatgatatt 


tgcttccgca 




cyytgaaact 


cacsaaccca 


acttatggag 


atctgaatca 


tcttggtrrg 


y rayatkcsa 


1680 


ytgctgagct 


tdgt rgaatt 


tvctaattwt 


ktyhamtdty 


yagtgtctgt 


aacaatgtct 




ggygtcacta 


catgycttcg 


cttccctggc 


caattgrayg 


ccgatctwcg 


taaactagct 




gttaacatgg 


ytccattccc 


tcgtcttcac 


ttyttyatgc 


ctggctttgc 


tcccctctct 




gcyaaaggcg 


cycaggctta 


ccgtgctctt 


actgtagccg 


agctwacyca 


rcaggtgcgt 




ctgcttatcr 


ttgwtgayrt 


gtgtttattc 


kttgtrtatt 


ttayagatgt 


tcgatgccaa 




aaatatgatg 


gccgcttgcg 


accctcgaca 


tggacrttat 


ctcaccgtyg 


cage cat gt~t 




ccgaggacga 


atgagcayga 


gggtaagtgg 


mtkrnttggyc 


ytytaryaya 


rctcrgacga 




awtgctgtta 


tgtcmtagga 


rgtagacgac 


cagatgatgt 


cagtgcagaa 


caagaactcc 




tcatacttcg 


tagagtggat 


tccgaacaac 


gtcaaraccg 


cygtatgcga 


cattccgccr 


2220 


agaggactga 


aaatggccgc 


taccttcgtt 


ggaaacycaa 


ctgccatcca 


agagctgtty 


2280 


aagcgcattt 


cagaacaatt 


yacaggttdg 


tttgtgcaya 


ttatggtgaa 


agcagattar 


2340 


ttgcgaygtt 


gcagctatgt 


tccgccgcaa 


agcgttyttg 


catyggtaya 


ctggwgaagg 


2400 


tatggaygag 


atggagttca 


ctgaagccga 


gtccaacatg 


aatgatctca 


tctccgaata 


2460 


ccarcaatac 


caggttcggc 


tgtytttcwt 


rgayactgtr 


tttaataatt 


wttyttgtct 


2520 



aggaagctac cgctgacgat atgggcgatc tcgatgcgga aggcgctgaa gaggcttacc 
ctgargaata gamcagcaga ytgtgttgcg ttgttcgttt ctctrtgtca atgcgaaata 
cacattgatt gcgtt 



2580 
2640 
2655 



<210> 48 

<211> 20 

<212> DNA 

<213> Synthetic sequence 
<220> 

<221> mi sc_feature 

<222> (1) . . (20) 

<223> r is a purine (adenine or guanine); y is a pyrimidine (cytosine, 
thymine, or uracil) 

<220> 

<221> misc_feature 

<222> (1) . . (20) 

<223> n is adenine, cytosine, guanine, thymine, or uracil 

<400> 48 

ggncartgyg gnaaycarat 20 



<210> 49 

<211> 20 

<212> DNA 

<213> Synthetic sequence 
<220> 

<221> misc_feature 

<222> CD.- (20) 

<223> r is a purine (adenine or guanine); y is a pyrimidine (cytosine, 
thymine, or uracil) 

<220> 

<221> misc_feature 

<222> (1) . . (20) 

<223> n is adenine, cytosine, guanine, thymine, or uracil 

<400> 49 

gaytgyytnc arggnttyca 2 



<210> 50 

<211> 20 

<212> DNA 

<213> Synthetic sequence 
<220> 

<221> misc_feature 



fgi rSs^urine (adenine or guanine); y is a pyrimidine Ccytosine, 
thymine, or uracil) 



<220> 

<221> misc_feature 



tilt ^"adenine, cytosine, guanine, thymine, or uracil 



<400> 50 

tgraanccyt gnarrcartc 



20 



